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Abstract 

This  thesis  presents  NL-Soar,  a  detailed  computational  model  of  human  sentence  compre¬ 
hension  that  accounts  for  a  broad  range  of  psycholinguistic  phenomena.  NL-Soar  provides 
in-depth  accounts  of  structural  ambiguity  resolution,  garden  path  effects,  unproblematic 
am.biguities,  parsing  breakdown  on  difficult  embeddings,  acceptable  embeddings,  immedi¬ 
acy  of  interpretation,  and  the  time  course  of  comprehension.  The  model  explains  a  variety 
of  both  modular  and  interactive  effects,  and  shows  how  learning  can  affect  ambiguity 
resolution  behavior.  In  addition  to  accounting  for  the  qualitative  phenomena  surrounding 
parsing  breakdown  and  garden  path  effects,  NL-Soar  explains  a  wide  range  of  contrasts 
between  garden  paths  and  unproblematic  ambiguities,  and  difficult  and  acceptable  embed¬ 
dings:  the  theory  has  been  applied  in  detail  to  over  1 00  types  of  structures  representing  these 
contrasts,  with  a  success  rate  of  about  90%.  The  account  of  real-time  immediacy  includes 
predictions  about  the  time  course  of  comprehension  and  a  zero-  parameter  prediction  about 
the  average  rate  of  skilled  comprehension.  Finally,  the  theory  has  been  successfully  applied 
to  a  suggestive  range  of  cro.ss-linguistic  examples,  including  constructions  from  head-final 
languages  such  as  Japanese. 

NL-Soar  is  based  on  the  Soar  tlieury  of  cognitive  architecture,  which  provides  the  underlying 
control  structure,  memory  •  tnictuies,  and  Icarntng  mechanism.  The  basic  principles  of  NL- 
Soar  aiv  a  result  of  applying  thu.se  architectural  mechanisms  to  the  task  of  efliciently 
conipreheridmg  language  in  real  time.  Soar  is  more  than  an  implenieutation  language  tor 
the  system:  it  plays  a  central  fhcoretica!  role  and  accounts  for  rnanv  of  the  rnoders  novel 
empii ical  predictions. 
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Chapter  1 
Introduction 


A^ovi',  ihe  ovenvhelmingly  puzzling  problem  about  sentence 
comprehension  is  how  people  manage  to  do  it  so  fast. 

—  Janet  Fodor,  Jerry  Fodcr,  and  Merrill  Garrett  (1975) 


FODOR,  Fodor,  and  Garrett  certainly  had  it  right  The  ability  to  comprehend 
language  in  real  time  is  one  of  the  most  complex  and  impressive  of  human  cognitive 
skills.  Eiqually  impressive  is  the  staggering  amount  of  scientific  effort  that  has 
been  devoted  to  exploring  the  processes  of  comprehension.  Few  topics  engage  so  many 
disciplines  within  cognitive  science. 

Over  the  past  three  decades,  psychologists  have  uncovered  regularities  about  aspects  of 
comprehension  ranging  from  lexical  access  to  memory  for  text.  Although  many  theories 
have  been  proposed  to  explain  these  regularities,  most  address  a  small  set  of  phenomena,  and 
only  a  few  take  the  form  of  complete  computational  models.  In  artificial  intelligence,  there 
has  been  more  concern  for  building  processing  models  with  increasing  functional  coverage, 
but  most  complete  NLP  5>y.stcms  still  do  not  model  any  appreciable  set  of  psychological 
phenomena 

A  notable  exception  is  the  READER  model  of  Thibadeau,  Just,  and  Carpenter  (1982), 
which  is  one  of  the  earliest  examples  of  a  complete,  functional  comprehension  system 
that  attains  some  measure  of  psychological  plausibility.  The  continued  development  of 
this  theory  (Just  &  Carpenter,  1992),  along  with  some  recent  theories  emerging  from 
linguistics  and  computational  lingui.stics  (Gibson,  1991;  Keinpen  &  Vossc,  1989;  Pritchett, 
1988;  Weinberg,  1993),  indicates  that  unified  computational  accounts  of  certain  a.spects  of 
sentence  comprehension  are  within  reach.  Each  of  these  theories  addresses  a  significant 
range  of  phenomena  with  a  single  set  of  mechanisms  or  principles  (a  (lisenssion  of  these 
and  other  theories  appears  in  Chapters  2  and  9). 

I'his  thesis  lakes  anotlier  significant  step  toward  a  unified  theory  of  sentence  i.ornore 
iiension  fiy  presenting  a  cumpulalioiial  inode!,  NT..-, Soar,  that  .satisfies  the  li)!lowmg  goals: 

i.  Hu'ddih.  ’['lie  itn'ory  models  a  '.vider  i.inge  of  psvehologn.  a!  (ilienomena  diat'i  lias 
previoiisiv  tHHai  given  a  eoliesive  aeeount 
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2.  Depth.  'T’hc  theory  models  the  phenomena  with  a  depth  matching  or  exceeding  the 
cut  rent  best  theories  tor  tiiose  phenomena. 

3.  Architectural  basis.  The  theory  is  embedded  in  an  independently  motivated  theory 
of  tlie  cognitive  architecture. 

4.  Fimctionalit}'.  The  theory  functions  as  a  working  comprehension  system. 

The  remainder  of  this  chapter  elaborates  these  goals  by  providing  an  overview  of  the  target 
phenomena,  an  explanation  of  what  it  means  for  the  theory  to  be  architecturally-based,  and 
a  preview  of  the  theory  and  major  results.  The  chapter  concludes  with  a  reader’s  guide  to 
the  remainder  of  the  thesis. 

1.1  The  core  sentence-level  phenomena 

NL-Soar  addresses  six  kinds  of  phenomena  that  form  a  cluster  of  regularities  at  the  sentence 
level.  The  phenomena  are  primarily  about  the  on-line  processes  involved  in  piecing  together 
the  words  in  a  sentence  to  form  a  meaning.  Though  NL-Soar  necessarily  embodies  some 
plausible  assumptions  about  lower-level  processes  such  as  lexical  access,  and  higher  level 
processes  such  as  creating  a  long-term  memory  of  the  comprehended  content,  the  theory 
does  not  yet  model  the  phenomena  at  these  levels  in  significant  detail.  However,  the 
sentence-level  processes  and  the  phenomena  surrounding  them  form  an  important  core  that 
must  ultimately  be  addressed  by  any  comprehension  model.  The  phenomena  are; 

1 .  Immediacy  of  interpretation  and  the  time  course  of  comprehension.  Our  subjec¬ 
tive  experience  is  that  we  comprehend  language  incrementally,  understanding  each 
word  as  it  is  heard  or  read.  As  a  hypothesis  about  the  comprehension  process,  this 
has  been  advanced  as  the  principle  of  immediacy  of  interpretation  (Just  &  Carpenter, 
1987),  and  much  experimental  evidence  has  accumulated  in  support  of  it.  In  general, 
immediacy  holds  for  all  levels  of  comprehension — syntactic  parsing,  .semantic  in 
terprelation,  and  reference  resolution.  Furthermore,  this  munediate  comprehension 
happens  rapidly,  at  an  average  rale  of  ''■240  words  per  minute  in  skilled  reachag. 
Although  the  average  time  per  word  is  ms,  eye  fixiilion  studies  also  reveal  that 

fixations  range  from  as  little  as  SO  ms  to  HXK)  ms  or  more. 

2  Ambiguity  re.solutiou.  VV'heo  reade^^  oi  listcneis  encounter  an  ambiguiiv,  how 
do  lliey  deeide  vvliieh  mtei pretatioii  to  give  u'  A  theory  ol  coiigtreheiision  must 
specily  wtiai  .knowledge  is  brought  to  beat  m  resolving  ainbignities,  and  tiow  and 
when  lha!  knowletige  is  luoughi  to  ticai,  lliere  are  several  kinds  ot  .unl'iguities 
tfiat  .irise  in  coinineheiisioii.,  raiisong  from  ierical  scinaiitn.  to  leteieniiah  bni  here 
V',  e  [inniani\  lieus  on  sinu  tiuiii  ainbigintv  ahrrrialixf  nitei  pieialions  dial  anse 
because  the  parii.d  iiit-arance  is  consistent  Vviili  mullipli.  svm;K  tic  p.ii mss  i  i  t  lH'lo\^ 
goes  a  sinijde  ev.iinple; 

(  !  )  ’!  he  cn|i  saw  the  ilog,  w  ifh  ihe  inru.K ail.iis 
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Sentence  ( 1 )  exhibits  a  structural  ambiguity;  the  prepositional  phrase  with  the  binoc¬ 
ulars  can  attach  to  saw  or  dog.  General  knowledge  may  prefer  to  interpret  binoculars 
as  the  instrument  of  the  seeing,  but  in  certain  specific  contexts  the  binoculars  may  be 
associated  with  the  dog. 

The  empirical  evidence  concerning  the  knowledge  sources  used  to  resolve  ambiguities 
is  mixed.  .Some  studies  have  demonstrated  that  the  semantic  content  of  the  sentence 
or  the  established  discourse  context  can  have  an  effect  on  the  on-line  resolution  of 
local  ambiguities.  Other  studies  have  shown  the  lack  of  such  effects,  demonstrating 
instead  an  apparent  preference  for  one  syntactic  structure  over  another,  independent 
of  content  or  context. 

3.  Garden  path  effects.  A  garden  path  effect  arises  when  a  reader  or  listener  attempts 
to  comprehend  a  grammatical  sentence  v/ith  a  local  ambiguity,  misinterprets  the 
ambiguity,  and  is  unable  to  recover  the  correct  interpretation.  The  result  is  an 
impression  that  the  .sentence  is  ungrammatical  or  nonsensical.  (2a)  below,  due  to 
Bever  (1970),  is  the  classic  example.  Raced  may  be  taken  as  the  main  verb  of  the 
.sentence,  or  a  relative  clause  modifying  horse.  The  relative  interpretation  is  globally 
correct.  ((2b)  has  a  parallel  structure,  but  driven  is  unambiguous,  so  the  garden  path 
is  avoided.) 


(2)  (a)  The  horse  raced  past  the  barn  fell. 

(b)  The  car  driven  past  the  station  stopped. 

The  subjective  experience  provides  compelling  linguistic  evidence  for  the  unaccept- 
ability  (jf  these  .sentences,  but  additional  experimental  evidence  comes  from  reading 
times  and  granimalicality  judgments.  Tlie  reduced  relative  construction  in  (2a)  is  hut 
one  kind  of  garden  path;  a  collection  of  26  dificrent  types  is  pie.sented  in  Chapter  2. 
I'hough  (he  garden  path  effect  has  [x-en  well  known  since  Bever’s  (B>70)  article, 
Prilclu'tt  (  was  the  In. si  to  deal  m  depth  with  the  vai  ictv  of  constructions. 

d.  l.Ii3[irobleinatic  ambigiiitie.s.  .Some  local  ainbigiiilies  do  not  i.  arise  difficuhv  no 
matier  whieh  iiiterprelation  proves  to  be  tlu;  eloltallv  eruiec!  one.  ( 'oiisidei  'he  [lan 
eh  sciili'iici's  111  ( ) 

(hr  I  (a)  1  know  .loliii  \ei\  wi  ll 

(b)  1  know  lohn  went  to  the  s(o!e 

riieie  IS  a  hu  al  ainhiginlv  at  John,  siiiec  U  v  ould  citla  i  iv  lia,-  diiecl  ofi|cci  oi  know  oi 
i !  ic  ait '  ICC  I  ol  an  incoming  r  Inn  sc.  'i  i.a .  u'l.'.iul  lc^^  ol  1 1  ic  !  i  n.il  caiU.  >  m  n  ■ ,  1 1  ic  sen  tc  1 1.  c 
I  .ii,!  Si's  no  pci  cepi '  hic  pi .  K  ess  mg  it  i  ( la  u  it  !  Ik‘i  c  ,iu'  a  \\  iik  ^  .iiic;  \  ol  i  oust  i  net  loi  i  s 
\‘,  nil  mipi  ol'li-m.ilu  loc.il  .1  min  gill  lies.  (  li.i[>lci  |  a  i' m  ails  ,i  eollcv  I  ion  ol  I  ililU'i  cm 
kihils  1  liCM'  consdiKlioiis  pioNiilc  ailiiitioii..il  coii'ai.iini  P.i  a  ihcoiv  inicinicil  ii' 
model  ;.',udi'i!  pnlli  cllciis  liu  pi.isiicil  mcch.nusiu  niust  Iv  enoui'h  lo  piediei 

d 1 1  ii.  u 1 1\  Oil  i'.ii  .it'll  p.tihs  ini!  no!  so  \\  cak  ili.il  It  V  .nil u a  pn  ,e 1 1 1. •  1 1 n |  n  ,  ,1  gf  1 1 ..u  k 
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5  ■  Parsing  breakdown  on  difficult  embeddings.  Some  constructions  without  structural 
ambiguity  are  difficult  for  people  to  comprehend.  The  effect  is  similar  to  a  garden 
path,  but  the  source  of  the  difficulty  is  not  recovery  from  a  misinterpi  eiation;  instead, 
it  seems  impossible  to  parse  the  sentence  at  all.  Consider  the  following  center- 
embedded  sentence: 

(4)  The  dog  that  the  man  that  the  cat  cha.sed  bit  ran  away. 

Such  structures  indicate  that  there  is  .some  kind  of  severe  limit  on  the  human  capacity 
to  handle  certain  kinds  of  embeddings.  As  with  garden  paths  and  unproblematic 
ambiguities,  there  exists  a  range  of  such  structures,  of  which  (4)  is  just  one  kind: 
Chapter  2  presents  a  collection  of  17  different  types.  Gibson  (1991)  w'as  the  first  to 
deal  in  depth  with  this  variety. 

6.  Acceptable  embcdding.s.  Complementing  the  structures  that  cause  parsing  break¬ 
down  are  those  embeddings  which  do  not  cause  such  a  breakdown.  For  example; 

(5)  The  dog  that  the  man  bit  ran  away. 

(6)  That  the  food  that  John  ordered  tasted  good  pleased  him. 

Sentence  (5)  shows  that  the  structure  in  (4)  becomes  acceptable  with  one  less  embed¬ 
ded  clause.  (6)  is  an  example  of  a  fairly  complex  embedding  (involving  a  sentential 
subject)  which  is  nevertheless  acceptable.  A  collection  of  26  acceptable  embeddings 
is  pieseiUed  in  Chapter  2.  Such  structures  constrain  theories  of  parsing  breakdown 
just  as  the  unproblematic  ambiguities  constrain  garden  path  theories. 

The  eviclencc  for  these  phenomena  comes  from  work  in  speech  comprehension  and  reading, 
.since  thev  arise  in  both  skills  (in  fact,  cross modal  techniques  are  an  important  source  of 
data).  Although  there  are  mdcperuieiit  issues  as  well  -lot  example,  control  of  eye  nurve- 
meiUs  IS  not  .is  crilu':il  m  speech  eom[iieheris:on  as  in  le.uliiie-  a  leasonahle  a.ssumplion 
IS  that  a  shared  suirsei  of  ilie  ee-mineherisioii  processes  mideilv  iliese-  sliareil  phenomeiia. 

I liesi,'  conuncu!  [iioeesscs  arc  uhai  NI  .So.i;  is  iiileiideil  to  nioilel. 

1.2  ArcliitecturaJ  ba.si.s 

In  the  l'>S  j  \\  illi.iin  l.uiies  !  eeluies,  .Aneii  N'l.'vvell  issiRsl  .i  e.ill  toi  i heoi el ,u,  ,il  uiiiliiMliun  in 
ps\  r  ln.i|i );' s  Newe!!  x-,  .is  ei'iieeiiied  wiili  the  |nolilei,iluiii  ul  mu.  lodieoiies  in  ps\ ei  u  ilog  x . 
and  (illeiisl  ihe  de  \  e  it  )[m  lenl  oi  <  ns',s';;i,--  ,iVi /u.C-'i /a/.- .s  .i\  ihc  |si:ti  lo  .i  iiuue  emiuila 
ioe  seieiivc  \  tsuMiiiiv  v.-  ,ii._  Inleeinie  is  the  li  xCei  v:oinpul,!lU'n,il  siiMi,!ine  lii.il  sllpp^.r!^ 
Si  s: '  111 !  li '!  1  II  siu'silu',  IlK'  SDlltlii!  s!  I  nslii!  es,  i  nei  i  h  nii"..  .uivl  piliiiilixe  j'liHs-  as  uiuieip, 
III;’  ,il  I  Ids  i  1 1  i !  \  e  I  u[:.i\  loi 

IJifiieli  ihe  iiiUion  ed  .nsliileetim.'  w sn-  not  new  in  IdN  iNeweli  ,V  Sniion  Id  '.', 
('.lid  (•(  ,i!  .  IdS'',  .Viuleison,  Id.Sv  I’xK'.Imn,  lOS-li  .x,e\xe!l  esl.ihl ished  ilie  le.rsdiliix 
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sweeping  the  time  scale  of  human  behavior  from  immediate  reaction  tasks  (hundreds  of 
milliseconds)  to  long  stretches  of  problem  solving  behavior  (minutes  or  more).  He  also 
provided  the  necessary  methodological  tutelage,  demonstrating  how  to  take  an  architectural 
theory  seriously  and  how  to  apply  such  a  theory  in  a  variety  of  ways. 

The  theory  Newell  used  as  an  exemplar  was  Soar,  an  integrated  architecture  developed 
by  Newell,  John  Laird,  and  Paul  Rosenbloom  (Laird  et  al.,  1987).  Soar  is  a  problem 
space  architecture  that  combines  a  long-term  parallel  recognition  memory,  a  declarative 
working  memory,  an  open,  flexible  control  structure,  and  an  automatic,  continuous  learning 
mechanism.  The  integration  of  problem  solving  and  learning  capabilities  into  general- 
purpose  architectures  has  been  a  focus  of  recent  research  in  artificial  intelligence,  and  Soar 
is  but  one  system  among  many  (Laird,  1991).  However,  most  AI  architectures  are  not 
proposed  as  psychological  theories  (e.g.,  prodigy  (Carbonell  et  al.,  1989)),  In  psychology. 
Soar  joins  the  ranks  of  other  architectural  theories  of  cognition,  such  as  C.APS  (Ju.st  & 
Carpenter,  1987)  and  ACT*  (Anderson,  1983). 

The  comprehension  theory  presented  in  this  thesis,  NL-Soar,  is  built  on  the  Soar  archi¬ 
tecture,  and  grows  out  of  the  theory  Newell  sketched  in  the  lectures  and  later  in  his  book 
(Newell,  1990).  What  does  it  mean  for  NL  Soai  to  be  grounded  in  the  Soar  architecture'.’ 
It  means  that  NL-Soar  specifies  how  the  functional  requirements  of  comprehension  arc 
computationally  realized  within  the  architectural  mechanisms  of  Soar.  NL-Soar  takes  the.se 
mechanisms  as  given — theoretical  liypotheses  independently  motivated  by  other  functional 
and  empirical  considerations.  Grounding  Is'L-Soar  (or  anv  other  cognitive  theory)  archi¬ 
tecturally  also  means  it  is  possible  to  deal  adeijuately  with  tjuestions  about  what  fiart  of 
die  computational  model  sluuild  be  taken  as  carrying  theoretical  lontent,  ami  what  pari  is 
simply  implemematioii  detail  (Pylyshyn,  Id8-1;  Newell,  1990) 

I  hiis,  the  1  act  tliat  NL  Soar  is  embedded  in  Soar  is  not  an  implemeiitaiiona!  sKienote 
to  tins  tliesis,  but  c.uiies  iheoietical  content  fhe  lot.il  theoiA  is  .i  comt'nnaiion  ol  tlie  Soar 
aicliitectuie  .ind  the  adilitioiial  content  speciiied  bv  Nl,  So.ti  that  reali/es  tlie  timctioiis  ol 
compielu'iisioii  Wo!  king  w  itlim  So.ii  .ilso  slieds  new  liglil  cai  old  issues  such  ,.is  iiiodiil.ii  il\, 
and,  .IS  v\e  shall  see,  i.iises  compleleiv  lunel  issues  as  well  (  h.uMeis  '  and  o  v  dl  deal 
exi'licilh  w  nil  the  lole  ot  So.ii  m  die  theoic  .u\d  ils  jnedici ions 


1.3  An  iiiiploniiiUecl  systpiii 

Nl  S>'.i!  c-  ,m  inipl'.iuciili'd  natui.il  l,iii'ma''.c  sx-tein  iiinniiu'  m  Si.miO  iw.hich  r.  iiii)'!c 
!  i  u'liled  1 1 1 1  I,  ,iik1  1  s  I  >ci  lie  Used  i  w  se  \  ei  ,il  i  escai  ciu'i  s  lu  I  lie  So.ii  eoi  iiinun  i  ( x  i  see  (  ii.i|'tei  ' ) 
f  o  I  po!  li  1  e  I  s  So  I  li  1  s  I  a  fie  I  XX  ol  k  i  1 1  ,  onsi  si  s  >  >!  .i  -.e!  ol  '•  'O  s,  i  e  i  a  ;  id  1 1,  i  lous  .aid  i  iii  m  e  1 1  r  ii  i 
doul'k:  s  III  si.'e  .e.  .1  lesull  ot  le.iimiie  All  oi  ihe  ex.uiiples  pieseuievi  m  dii  i  en  i.i  1 1 1  ii  i  ' 
el  I  .)|  '1  e  1  s  h.i  \ ;  bee  11  i  u  u  llii  oii;/  h  die  sx  sli'iu  ui  i  le  ss  e  \  p!  k  1 1 1  \  i  lole,  i  oi  I  lei  i  ■  e 
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Chapter  I.  Introduction 


1.4  Brief  preview  of  theory  and  major  results 

The  NL-Soar  model  realizes  the  functions  of  real-time  compiehension  with  a  small  set  of 
operators  that  perform  syntactic  parsing,  semantic  interpretation,  and  referential  processing. 
The  control  structure  is  a  mix  of  serial  and  parallel  processiitg,  and  is  open  to  modulation 
by  multiple  know  ledge  sources.  The  comprehension  skill  is  held  in  a  long  term  recognition 
memory  that  generates,  selects,  and  applies  comprehension  operators.  Most  of  the  pro¬ 
cessing  happens  by  immediate  recognition  (automatically).,  but  NL-Soar  can  always  break 
out  into  deliberate  processing  as  necessary.  As  a  result  of  roar's  chunking  mechanism 
comprehension  is  a  continuously  improving  mix  of  deliberation  and  recognition  behavior. 
(In  fact,  all  of  NL-Soar’s  recognitional  operators  arise  via  chunking.)  Misinterpretations 
are  corrected  by  an  on-line  repair  process,  w'hich  is  constrained  to  ensure  computational  ef- 
hciency  in  the  problem  space  search  and  recognition  match.  The  partial  syntactic  structure 
in  w  orking  memory  is  efficiently  indexed  in  a  manner  that  also  ensures  efficient  recognition 
match. 

The  basic  properties  of  NI.--Soar  and  the  Soar  architecture  interact  to  account  for  a 
wide  range  of  sentence  processing  phenomena.  The  model  predicts  a  variety  of  both 
interactive  and  modular  ambiguity  resolution  effects,  NT  Soar  provides  the  first  explicit 
niodel  oi  deliberate  recovery  from  garden  path  effects.  In  addition  to  accounting  for  a 
number  of  other  qualitative  phenomena  surrounding  garden  paths  and  parsing  breakdown, 
the  motiel  successfully  makes  detailed  predictions  on  a  collection  of  over  HX)  different 
garden  path  structures,  unproblematic  ambiguities,  difticuit  embeddings,  and  acceptable 
embeddings  The  collection  is  primariiy  Hnglish.  hut  includes  a  range  of  cross-linguislic 
Items,  uicliiding  beat!  tinal  constructions  The  nuxlel  also  makes  a  several  (|uantitative 
[iredi  ,  lions  concerning  the  time  course  oi  comprehension,  including  the  hrst  /ero-parameter 
[uedi .  lion  ol  com|irchension  late. 


1,5  Reader’s  piiide  to  the  thesis 

I  he  thesis  has  three  ni.i)or  [xuls  phenomena,  tlieoiv,  and  iheorv  .q’lplication  1  he  [ihenoiii 
eii.i  .ae  levieuc-ii  in  t'liaplei  .liong  w  ii'i  .i  desei  ipi h.mi  oI  I'lev  nuis  tlieoiies  (  h.ijuei  ' 
piesi-nts  the  Nl  ,So,ir  theorv  in  det.ii!  1  he  ..ipplie.ituin  ol  ifie  ilu'orv  to  the  senleiu c 
pnx'evM  lie  plienomeiia  is  pie  sente  d  m  ( 'li.i(>li.'i  s  d  S  (,  liapter  d  pn  o  ides  .t  ''ii:  i  ini.n  v  ot  tlie 
inotlel  ,n;d  the  |>'eilk  tions,  ,is  ueil  .is  .i  ^iiscussuni  o!  geneiai  theoieiu  .il  issues  ,uul  ,i 
I  loseU  rei.iied  models  I  oi  die  one  Miiiiute  version  ol  the  thesis,  look  at  i.tl'les  1,  o 
•iiid  '.f  ; 

(  h.ipiei  iii.iv  I'v  ,i)'pio,uhed  in  sevet.il  dilleo.nu  vv.ivs  It  voui  sumI  is  lo  olii.uii  |usi 
A  hnel  o\  c!  V  lew  ol  die  ptieno, neii.i  llK'insei \ es,  (fie  liiuil  sect loii  pi ov  ides  ,i  siumnai  \  I  he 
t.ihie-  ID  ihv-  (iiid.dle  i  I  die  ch.iplei,  iisinii:  the  g.iideii  ('aih  .u'd  p.usme  hie.ik.low  u  e\,irn|'les. 
ill,!',  .ilso  he  tie  I  pi  I!  I  1 '  \ '  'll  ,11  e  luleiesicd  ni  llie  l  heoiei  ie,,l  w  of  k  .  \  >  'ii  .n  i  u  -e  du'  sii  nu  n.ti  v 
SIS  I  h  III  !o  1 ,11111 !  i.ii !/('  \  I'lii  se  1 1  V,  iih  the  h.isu  pheiiiiiiien.e  dien  i  is;  !  i he  I tu  i  i  v  disi  usmom  ■. 
.tl  ilii  ei'id  ','1  esu  h  lii.iiiii  seelioii 


Chapter  2 

Human  Sentence  Comprehension: 
Phenomena  and  Previous  Work 


All  theories  live  under  (he  same  sword  of  Damocles. 
—  Alien  Newel! 


MODEPvN  psycholinguistics  began  with  a  concern  for  the  relationship  between 
sefitence  comprehension  and  the  grammars  of  linguistic  theory  (Miller,  1962), 
and  since  then  sentence  processing  has  remained  one  of  the  most  active  areas  in 
the  held.  Given  the  tremendous  arriount  of  accumulated  work,  it  is  impossible  to  provide 
a  complete  review  here.  Instead,  the  primary  purpose  of  this  chapter  is  threefold:  1 ) 
Establish  the  phenomena  relevant  to  the  model  presented  'm  Chapters  3-7;  2)  Review 
previous  theorie.s  proposed  to  explain  these  phenomena;  and  3)  Motivate  the  choice  of 
phenoinena  by  demonstrating  how  they  provide  great  leverage  in  identifying  the  central 
mechanisnis  of  sentence  compre.iension. 

/■V  section  is  devoted  to  each  major  plienontenon,  and  each  section  concludes  with  a  re¬ 
view  of  tile  lelevant  tlieones.  Each  theory  is  discussed  only  with  respect  to  the  phenomenon 
of  uueresi,  'Tins  orgaiiizatior!  has.  Iht:  di.'-iadvantage  that  ihe  discussion  of  any  paiiicidar  ilie 
ory  may  be  distributed  across  seveta!  sections  flowev?’-!,,  a  few  theories  closely  related  tc> 
NI.,-St>ar  in  go.ils  and  content  (particularly,  Gibson  {1091),  Just  N.  Carpenter  (1992),  and 
Erilchctt  ( will  Ite  examined  again  .m  CftiipSer'-), 


2„  1  I'he  produtls  of  co 


Hri\>ic-  csaimiiing  (he  ph'seonuma  surroin'idmg  sentence  processing  ss'C 

-iumlil  tits'  cnri -uiJe;'  die  Innctirma!  reipiiicmcnls'  Wh,u!  >lov.‘s  ccanpicficssioii  nceci  to  do" 
(  'LiJ'k  A  t.'iari,  i  1 97  7  I  gi\  c  fa  it  answers.  In  the  naruiw  sense,  vdis:  b  pjc-s- 
coSiCMchfasiisn  serscs  to  t'Uih,!  a  rcp'-csemuiion  ot’  the  me.mtu:-  ^;■f  die  iingtusnc  mriat  7  iie 
;iri);uk:r  ssiisc  ol  cosnptc'tension  includes  u(ili:'c!Uiin.  v.-m  h  leJcrs  to  vshul  tlic  iistorKT  ov 
rvsuici'  de-i.'-,  s\s!!i  tb-;'  oreaumn  (;sk:c  :s  g!'asjKs.t . c  g  store  it  ii;  oe'ivioiv,  t>e'ii.-se  it,  lIs  n 
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(if  it  is  an  instruction),  answer  it  (if  it  is  a  question).  While  keeping  in  mind  the  warnings 
of  Clark  and  Clark  thai  the  processes  of  construction  and  utilization  are  often  not  clearly 
separable,  the  remainder  of  this  chapter,  and  most  of  this  thesis,  treats  comprehension  in 
the  narrow  sense — as  the  construction  of  a  meaning  representation. 

This  section  reviews  evidence  for  tiiree  kinds  of  representations  produced  as  final 
outputs  or  as  intermediate  by-products  of  sentence  corr'prehension:  a  referential  model,  a 
context-independent  semantic  encoding,  and  a  syntactic  structure.  The  evidence  will  range 
from  purely  functioi'al  to  empirical. 

2.1J  Referential  representation 

We  have  taken  as  given  that  comprehension  produces  a  representation  of  the  meaning  of 
the  linguistic  input.  But  we  need  to  be  careful  about  v/hat  is  meant  by  a  repre,sentation  of 
nieaning,  since  philosophy  and  linguistics  have  traditionally  used  the  term  meaning  in  a 
restricted  way. 

Sense  and  l  eference 

The  standard  conception  of  sentence  meaning  traces  its  roots  back  to  a  distinction  made  by 
Frege  (1892).  According  to  Frege,  linguistic  expressions  have  a  sense  and  reference.  The 
reference  of  an  expression  is  what  that  expression  denotes.  The  sense  of  an  expressior.  i.s 
the  manner  in  which  the  reference  is  made.  The  distinction  can  be  seen  most  easily  in  noun 
phra,ses: 


(7)  (a)  The  most  famous  professor  at  Carnegie  Mellon 

(b)  Herbert  Simon 

Both  expressions  refer  to  the  same  Nohel  Tauieate,  but  in  a  diflereiu  manner.  Because 
then  denotation  i.s  tlic  s.itne,  the  (.‘xpressions  can  be  interef  anged  in  some  Muitences  witliout 
ali'ecting  the  truth  value  of  the  sentences: 


(8)  (a)  Alonso  Inul  kiiich  with  Herbert  Simon. 

([>)  Alonso  had  luiich  wi:h  t.ic  inosi  ,!'i;;.niO’i,s  piofessor  at  Carnegie  Mellon. 

However,  laege  penniei!  oiil  lliai  a  [uiiely  detioi'.<ttio(i;.ii  .'iccount  ol  senk'isc.."  Uie.a.nmg  is 
unu oiKahle.  (  'miMdcr 

O')  ipll  (iniiks  he  !s  ihe  most  taivioiis  [irolcviOr  at  f 'arnegb.'  Meilom 

Seine  I  ice  ( 'fi  cic.ii  i\  doi-s  no!  mean  I  hat  Hil!  llmiks  he  is  i  lethci  a  Sinujn  ri;,.;  i  v.'o  slateiT.ccn-' 
..iic  InmcallN  mi,l-.  jieihlelil 

the  I"  tioii'  ol  '■eiiNC  ,iihi  leleieiK''  !ia\e  iiiH'  lire  oa'i:,  piei.'i'.i;'  wificcpl:-;  of 

:r:!rr.\’(ni  ,md  c Ot /'.-./'.'o  in  bnmai  semaiC.cc  ti  nu-o,  logyi,  iii  scin.iiitics  fiasco  on 
[)(0. .  i i lie  v\ !  M  k  's  :  m  -a .n-'s  .,)i  anairs '  liu;  mtcnscci  of  a  fViedi,:'.\ie  i sac-  o  o;  /./cr/cc .' c, ,,  vi ) 
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is  a  function  from  possible  worlds  to  sets  (.)f  objects.  Tlie  extension  of  a  predicate  in  a 
particular  world  is  the  set  of  objects  m  that  world  which  are  selected  by  the  predicate;  the 
objects  are  the  result  of  applying  the  intensional  function  to  the  given  world.  Similarly.,  the 
interrsion  of  a  sentence  is  a  function  from  possible  worlds  to  truth  values.  In  other  words, 
the  intension  is  what  determines  whether  the  .sentence  is  true  in  a  particular  situation.  The 
extension  of  a  .sentence  is  then  usually  taken  to  be  a  truth  value. 

Given  these  definitions,  the  traditional  view  in  philosophy  and  linguistics  is  that  the 
meaning  of  a  sentence  corresponds  to  its  intension,  or  truth  conditions  (Lewis,  1972).  Yet 
it  is  clear  from  examples  such  as  ^8b)  that  meaning  in  this  sense  is  often  not  ali  that  is 
important.  Depending  on  the  listener’s  knowledge,  what  is  being  ccmmunicatcd  in  (8b)  is 
not  just  that  Alonso  had  lunch  with  the  most  farnou.s  professor  at  CMU  whoever  that  is.  but 
that  Alonso  had  lunch  with  Herbert  Simon.  This  leads  to  our  first  functional  requirement 
for  comprehension: 

Comprehension  builds  a.  referenthrl  representation  which  contains  information 
about  the  particular  referents  of  the  discourse  in  titc  pariicuiar  situation  de¬ 
scribed  by  the  discourse — not  just  the  .set  of  truth-conditions  of  the  constituent 
sentences. 

From  an  A1  point  of  view,  this  is  a  somewhat  obvious  assertion  to  make,  because  the  useful¬ 
ness  of  a  referential  representation  has  been  established  since  SFRDLU  (Winograd,  1972). 
Nevertheless,  this  functional  requirement  wa,>  not  alw'ays  appreciated  in  psychology'. 

Work  in  psycholinguistics  has  led  to  an  additional  claim  about  the  referential  repre.'mn- 
t  at  ion; 


The  referential  represenlatic<n  is  what  is  retained  as  the  piiniar>  and  final 
product  of  comprehension 


A  form  C'f  lliis(.:laiiii  lirst  rJiowed  up  first  a.s  the  const  me!  ive  ilieory  of  Rran.sford.  Barclay, 
and  Franks  ( 1972).  They  prop(;sc(i  that  comprehension  constrnct.s  a  rcpreseniation  of  the 
described  situation  that  integrates  the  iii!<>rn;a(i(.>n  cxphcil  in  the  input  aiong  with  infc'rcd. 
or  background  information.  Jlie  result  is  a  single  cul-ercnt  repii'scniation  of  the  Luutcnl, 
111  tlieir  experiments,  .Mibjcefs  ,'v.ul  slion  paragr.iplis  < irsta dung  Muij)'  ■  eonlignratimis  of 
olijcets,  such  a'  (  lb): 

(  lb)  Three  tiirlles  resrcd  on  a  tluaune  log,  .»  id  a  lisli  swam  In-n  ath  them. 

On  subsequent  rcfa-ciiition  tests,  tbev  fonnd  tinil  sutijoets  could  no'  .n  .  nmu  f,  distnignisli 
botwean  sentences  that  v'.'em  prcM.-'n  in  tfic  ,:i.iram.;[»ii  and  duisc  !ti,i!  sseie  hu  m  !\  s  s/UM  di  ia 
vridi  ihe  siUi.iiion  (.icsenbed  In  die  ‘  erneuecs.  Wtiai  w.C'  iei.ame<l  '  iiot  the  ad  it  imaii 
m  s.dia'l'!  the  ii'doi m.tf ion  s^as  pi .'serUed.  '.'r  .i  s;-;  o!  scnn,-ii'  i.ieani!!:,'v.  bi.u  ,i  i '.'gn !.a!ti,t! 

fS;':;'.  i s  a  i;  X  .1: :  I  pu'  Uu' ‘  -.'lu'il'.n -lo  tii'isi  ,  u!  K  intsi.  h  .X  '..HI  i.'li.l.  I  .■■i.'al'  a,;'  I  ic!  i.  ■  ■■' it.ii  i '  .) 

leii  ■ .  ii  ti-it  :  ..a  iU-'A'ai  :j(e  ,i.  am!  Us  snt'aci.pit  la  ,  '■■■  isjui  iji  inai  ,i  Si,-)  ii'  la,  !  >  mi,  e  ,i  '  I'p a-'i: "  'i  ■-  .ta 
ntaa'Cani  ti  ir  V'.s.  g.  Kiao-eli  i'C'  C 
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ref)re:icntati('/n  of  the  Jcscribcd  r.ituation.  in  .snTiiJar  W'ork,  sevet  .U  studies  Stave  sSiov/ti  that 
suStjects  often  confuse  different  NPs  th-it  refer  to  the  satne  indivicittai,  indicating  agau! 
that  an  e.xfensi.onal  representation  is  retained  (Anderson  &  Eiower.  1973;  Anderson.,  ! 
Gari,>}ta;'n,  1987), 

Tilt?  form  of  rep  rest  rd-ation:  models  vs.  logics 

The  clalnt  tlrrl  cornprrhension  produces  a  referential  representation  is  a  kitovr’ledge-leveh' 
claim.  It  asserts  that  infotnialiori  'i.s  encoded  about  the  particular  referents  in  the  discourse, 
pos.sibiy  along  with  previously  known  or  i.nferred  information.  Bur  it  does  not  say  how  that 
information  is  encoded. 

One  possibility  is  that  the  repre.sentation  i,s  a  logic  of  some  kind.  For  e.'iampie.  if  the 
predicate  calculus  is  used,  then  k.no’.viedge  about  the  referents  is  encoded  as  a  set  ot  firsi.  order 
statements,  and  reasoned  with  !;>y  applying  the  i.n1ereniial  rules  of  the  calculus  The  way  the 
staiement,s  represent  is  defined  by  the  inodeb- theoretic  semantics  of  the  logic:  the  stideaicnts 
represent  a  particular  .siiuaticn  if  that  situation  is  a  model  of  the  statements  (Ebbmghaus 
et  ;il.,  i98‘i)  Th.is  correspondence  is  hahrect.  The  eleracnts  of  the  representation- -the 
variables,  r, connectives,  quantifiers,  etc. — do  not  neccs-sarily  map  d.!rectly  onto  the  domain 
being  represented. 

•Tiler.nat'Veiy.  tlierc  can  be  <i  direct  correspondence  between  parts  the  representation 
and  parts  of  the  repre:5ented  situation.  Individuals,  properties,  and  telaiicns  in  the  repre¬ 
sentation  can  map  directly  onto  .individuals,  properties,  and  relations  m  the  domain.  Such 
a  representation  is  not  a  logic;  .lohnson-Laird  (1983)  calls  it  a  mental  moael. 

Although  the  comprehen.sion  experinsents  discar.sed  ear.ber  csiatilisli  the  reality  of  a 
referential  representation,  they  cannot  help  settie  i.ssucs  aboul  the  nature  of  that  represen¬ 
tation.  To  distinguisii  logic  and  inenta.l  rncdels.  what  i,s  needed  i.s  a  study  of  tasks  that 
ta[  more  f  ectly  into  those  aspects  of  logic  that  violate  stiuctsire  cofi  espondencc.,  such  as 
qur  tification  or  disjunction.  <)ver  the  past  decade  o.i  more.  Johnson  l.adrd  and  hi,s  co! 
leagues  Irave  carried  out  just  such  a  rc.scarch  [ircigrarn,  studyin,!.'  subjects  i'ierior.mii!g,  explicit 
ivasonmg  tasks  such  as  caiegoricai  syllogisms.  'SJicse  tasks  are  scicvant  so  d  e  issua.'  of  vvhai 
coiriprclicnsion  produces  because  llicy  iin  oive  ihc  compsciicrcMon  oi  premises  piescnted  m 
ii.mna!  laugintac, 

.lohnson  1  .aird's  larnclusuai  that  luim.iiis  arc  not  using  a  logic  based  icpiC'^cnlaOria 
(  hrlinvoii  f  aiid.  IbS.t;  Joimson  i.aird.  1^88),  '1  he  coiV.insu.Hi  is  based  nnruirth'  ,sn  'he 
[sutma  of  ciroi's  that  Milyciils  jirodnce  winch  am  imt  consislcnt  wim  a  prom  iKiscd  accomii 
o!  lir.sr  n:asoiMng.  instc.ul,  the  msitlis  are  consis'cni  \v,di  a  irpn  senlal aaia!  ihcoiv  hna  can 
[>!■  siiniiii:a  i/ed  (  iii  (Wi  I  i  as  to|iov.'s 


t  \)nipK'!ii.'iisnin  (ardds 

i.‘iiii,ii  mpii s.einiau'ii!  •  d 


i  ioeitlal  rnrKici  ra  the  Imemsuc  iiijan,  '.'.  fni  [i  is  .>  rcic' 
.!  'unr  iPiO  ^OitCOisi!  ilea  r-.  conMacni  w'di  fu.  mra!: 


ITic  f'li'oikuAs  iil  comprchensioli  1  | 


"1  he  elcivicnis  of'fhe  rcprcser-tatioii  correspond  one  l:o  -one  to  elements  of'ihe  sit¬ 
uation.  (I’omprehcnsion  typictilly  p-oduces  just  one  mc.idel,  cveri  whcii  multiple 
models  are  possible. 

The  concept  of  a  refex'ential  representation  th;it  takes  Use  Corn',  of  a  mental  model 
is  now  estahii.shecl  in  psychology  and  psycholingmstics.  It  plays  a  central  role  in  most 
comprehension  theories  (for  example,  the  situation  model  of  (  Van  Dijk  &  Kintseh,  1983) 
and  the  referential  representation  of  (Just  &  Carpenter,  1987)).  It  continues  to  receive 
empirical  support  from  a  range  of  work  in  psycholinguistics  and  reasoning  tBower  & 
Morrow,  1990;  Johnson-Laird  &  Byrne,  1991;  Polk,  1992). 


jitf « .n.  •  Semantic  representation 

Comprehension  must  also  produce  a  refeience-indeperident  semantic  representation  that 
corresponds  lo  sense  or  intension.  Below  we  consider  the  functional  and  empirical  evidence 
for  such  a  representation. 

There  aie  two  kinds  of  functional  requiiernents  for  an  independent  semantic  represen¬ 
tation.  First,  uncovering  the  semantics  of  a  sentence  is  a  necessary  step  to  producing  the 
referential  repre,sentalion.  The  referents  of  expi  ession.s  such  as  the  most  famous  professor 
at  CiVllJ  in  sentence  (8b)  depends  on  the  underlying  semantics  of  the  expression,  not  its 
particular  surface  form.  Furthermore,  there  must  be  some  capability  of  creating  an  initial 
representation  of  a  referent  when  it  is  introduced  for  the  first  time.  This  capability  is  ef¬ 
fectively  a  process  of  producing  a  reference-  independent  representaiion  (because  it  cannot 
depend  on  first  retrieving  the  referent). 

The  !;econd  fiinctiorial  requirement  tor  a  semantic  representation  is  exemplified  by 
Fregean  examples  such  as  (9),  Such  examples  establish  that  tlie  sense  of  an  expression 
is  sometimes  independently  needed  in  order  lO  under.siaiid  the  expre,ssiou,  Oue-asiaphora 
providfc.i  another  good  example  of  this  requirement.  Consider  (11),  adapted  from  (.Allen, 
198  7); 

(11)  (a.)  Book  a  .seat  tor  ('ynthia  on  tlie  Ppm  flight  to  („)rian(io. 

d')  Book  one  for  Katy,  too. 

f'lcur’y  ('ut'  iii  (  !  Ifo  does  not  refer  to  the  seat  boetked  for  Katy,  but  rather  a  seat  dial  i.s 
idem i tie  1,1  lix'  cu.ribinmg  part  ol  the desci iptivin ol  the  .seat  in  (  1  la)  wi.di  pad  ot  the  lirseriplion 
m  i  1  !b;  There  must  be  at  least  a  icmpoiary  leiercnec-Huiepcudenl  meniory  of  die  Insl 
'■ernem;:.'  si  is  to  lie  interpreted  eorreetlv  iii  tfie  second  sentein.i. 

I  hen’  1:,  sii.si,'  Sam;.' i’nip'irieai  evidence  tor  an  ludcj.iendeiii  si  iiUsut u  pu'seiiiai  son  .Mam 
■y  ,i(  ihnson  I  aird  i  ! dh.'  i  perlonncd  '‘Xpci  iiuerus  tfiai  lesinJ  memo!  \  lor  \  i.  i  iialt',  [>i,’se!iled 
sjSiiial  iavoi'ls  1  J  hf  .spe/or/  ly  n>  !hi-  IrU  oj  Ptv  kr.iff.  I  hr  plow  i.v  it '  ih.r  >  o/i a  o!  ; hr  ktihr., 
nr.  r  1  aed  '.vci'e  tvio  kinds  of  ilcsi.  i ipUoiis:  <:!<  !(’> desenjiiiiiiis  wete  eoiisislem  e  ilh 
o!)  1  't'  sm  iC  pa,  Oio.i  lai  la  Vi  uu  _  and  oiorirnnoiou'  vti.‘se!'ipih;'ns  v,  l  1 1.  i,  tin  si  stem  h  I  \s'o  dudi  net 
S ,  vtnd s.  Maj'i  am,!  .lonn'-ou  - 1  .a net  assuiucii  tiiCi  subieeis  atte mpii  lo  i  'I'sti  in  i  a  i > lenl.i!  model 
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ctf  the  described  layout,  and  tiiat  two  mental  niodel.s  are  more  difficult  to  construct  and  retain 
than  one.  The  results  were  consistent  with  this  assumption.  Memory  for  the  layout  described 
by  the  determinate  descriptions  was  better  than  for  tlie  indeterminate  descriptions.  What  is 
relevant  to  tlie  referential/semantic  distinction,  however,  is  that  memory  for  the  verbatim 
form  of  the  description  was  better  for  the  indeterminate  de.scriptions.  This  suggests  that 
the  subjects  retained  a  superficial  semantic  representation  (which  is  naturally  closer  to  the 
surface  form  of  the  input)  when  the  indeterminacy  of  the  input  made  building  a  mental 
model  too  difhcult.  As  Mani  and  Johnson- Laird  point  out,  it  is  difficult  to  make  .sense 
of  such  a  cross-over  effect  without  appeal  to  two  different  kinds  of  representations.  A 
similar  effect  was  found  in  an  analysis  of  recognition  memory  for  a.  programmer's  manual 
(Schmalhofer  &  Glavanov,  1986). 

The  assumption  that  comprehension  produces  a  semantic  representation  is  widely 
adopted  in  AI  and  psychology.  For  example,  it  appears  as  the  logical  form  of  Allen 
(1987),  the  propositions  of  Johnson-Laird  (1983),  the  semantic  representation  of  Just  & 
Carpenter  (1987),  the  case  frames  of  (),  and  the  propositional  texthase  of  'Van  Dijk  & 
Kintsch  (1983)\  All  these  representations  have  in  common  the  central  property  that  they 
encode  the  sense  of  linguistic  expressions  in  some  intermediate  representation  prior  to 
computing  reference. 

2.1.3  Syntactic  representation 

In  this  section  we  consider  the  evidence  for  a  syntactic  structure  produced  as  an  mtermediate 
product  of  comprehension.  The  evidence  will  be  .strictly  functional,  since  there  are  still 
no  empirical  results  universally  accepted  as  indicating  the  existence  of  such  a  structure. 
The  claim  is  not  simply  that  syntax  is  required  to  comprehend  language;  that  much  is 
certainly  universally  accepted  and  almost  tautuiogicai.  Rather,  the  i.ssue  is  whether  or  not 
a  separate  syntactic  representation  is  computed  and  at  least  temportinl)  maintained  during 
cornfavhension. 

Makir/y  syntactic  disermmialions  is  nece  sarv  to  amvc  at  the  eoireet  meaning.  Hach 
aspect  of  svnta.x,  such  as  num  rei  .icieerncnt,  [loteiui.illv  elimm.ites  some  .semantic  anibi- 
L’uilv.  I'o  m.ike  these  disci  linmations,  . omiMchensuui  must  iii.imtaui  some  local  syntia  tic 
state  the  jirocess  c.innol  |usi  be  svntactically  driven  ar-d  leave  behind  noiisvniar  tic  lepre 
senlaiions.  i  wo  simple  exaniidcs  ill  illuslraU  the  point: 

(  1  7)  'toil  .ue  going  to  the  store 

I  M  !  I(  liin  s.i'.’v  him  Mi  ilu’  rMirioi 

I  li.i'.  I-  .i\  I  I  li  .111;'  /  's’/.s'.'/i;/ !( I  n-lci  ii.>  lliis  i I  ic(in.  ‘.!'iil,ii  i' m ,  .i  iici' t hr  ici  ii,  i ums I  i  ii  it:  f ti  i  t  iK 

«  .i\  If.  ( 'W  ., !  M  'h  r  -  \  I  1  n  uiir  ihrr  ;  [..t' .  a  i .,  .!  ,  I  .iirrn:  milr|iriii!rnl  im  iri.tl  ih.i!  .  .in  Is-  ir.i  ■,  1  e  i  rii.  Ir  .nc. 
!r '.  r  I  'll  1 1 'I 'I  ‘I"-,  r  III  ,(1 1 1  'll  i  r  ;■  .  >  iii'.l  .V  (  .ii  jK  iilr.c  I  ‘IS  .Mi  i  nhci  s  i  r'..r  t  \  r  llir  ti’i  i  n  I"  U'lri  i  ni ! Ii  >  I  h.‘  !t  t:  I  1 1| 

I  I'l'i  r  ■.rnt.iLi,  111  I'j  1, 11  1,1  I  rli'i  i.'ir.  ■  i  ■■■.■  iliiii.  ui  i  I.. ■Inn  i  ni  i  .iml.  I  Us  O  K .ill in  !  inn  i  i  k  i  li  i  ■.  >  -  nil  umi  mi  ,  1  c  i  i  i 
.  "Hi  I  line  n  ■  i  rl  r  i  ti '  ill"  n  ’ruin:,.  nl,  c  hu  ii  pini  r  .  I  hr  C  u  in.  "n  Hir  '  "iik'ii!  i  !  f  hr  i  '•[  n  r -.r  ni ,  il  i.  n , 

I  ,:i;;ri  ! Ii.ni  i r.  f in  i.i 
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In  (12),  the  syntactic  number  of  the  subject  of  the  sentence  (in  this  case,  you)  must  be 
represented  so  that  it  can  be  checked  against  the  number  of  the  verb  when  it  arrives.  If  the 
number  feature  is  not  independently  represented,  then  every  po.ssible  lexical  subject/verb 
pair  that  agrees  (e.g,  you/are)  must  be  stored,  and  the  check  performed  by  lookup.  That 
the  feature  is  not  purely  semantic  is  apparent  from  the  number  associated  with  the  English 
second  person  pronoun  you:  the  syntactic  number  is  plural  whether  the  pronoun  refers  to 
an  individual  or  a  set  (*John,  you  is  going  to  the  store.)  In  (13),  the  syntactic  structure 
of  the  initial  part  of  the  clause  (John  in  subject  position)  must  be  represented  so  that  the 
proper  reference  for  him  can  be  computed.  In  this  case,  him  cannot  refer  to  Jim,  otherwise 
the  reflexive  form  (himself)  must  be  used.  The  constraints  on  the  distribution  of  reflexives 
is  a  function  of  syntactic,  not  semantic  structure  (e.g.,  (Chomsky,  1965;  Chomsky,  1973)). 

Despite  these  functional  considerations,  some  scientists  in  both  AI  and  psychology  have 
argued  that  a  syntactic  representation  is  not  needed.  In  AI,  this  claim  is  best  represented  by 
the  work  of  Schank  and  colleagues  (Schank  &  Riesbeck,  1981).  They  constructed  a  number 
of  systems  that  parsed  directly  into  a  semantic  repre.sentation  (conceptual  dependency) 
without  an  intermediate  syntactic  structure.  In  psychology,  Johnson-Laird  (1983)  proposed 
that  human  comprehension  is  largely  syntactically  driven,  out  does  not  create  syntactic 
structure. 

However,  no  system  built  on  such  principles  has  manageo  to  achieve  anywhere  near 
the  st’uctural  coverage  of  systems  that  do  use  syntactic  representations.  The  syntax-lean 
systems  do,  of  course,  embody  some  syntactic  knowledge,  but  it  primarily  consists  of 
ordering  constraints  that  do  not  require  explicit  encodings  to  apply  (Schank  &  Riesbeck, 
1981 ).  Some  recent  work  that  has  emerged  from  the  tradition  of  the  Schank  conceptual 
parsers  does  in  fact  handle  more  complex  syntax,  but,  not  surprisingly,  the.se  .systems  do 
make  use  of  explicit  intermediate  syntactic  structures  (Cardie  &  Lehnert,  1991).  Thus, 
both  f  unctional  considerations  and  actual  .system  building  practice  provide  evidence  for  the 
necessity  of  explicit  syntaciic  encodings  ol  some  kind. 


2.1.4  Summary 

I’afile  summan/cs  the  tliiee  representations  and  tlie  evidence  for  them.  .Such  a  biiet 
eli.ii aeleri/alit'n  absirai  is  away  liom  an  enornioiis  number  ol  isMica  ior  example,  the 
entire  l.eld  ol  syntax  iii  linginsiics  but  it  serves  to  establish  the  basic  r  haraetei  ol  the 
out  jin  1  of  r  iMiirii  ehensioii  and  |)i  ovidi-  the  iieec'ssai  \  loiindal  loii  toi  di'^cnssing  tiu'  processing 
jiiicnoineua  aiul  liie  miniel  (nesenied  m  Chapici  V 

2.2  liiiniediiicv  and  the  dnic  course  ot  eonifirehensioii 

W’e  scun  lo  c.'mjnehend  speivii  on  .s  vviird  h’S  woid  Imsin,  .is  qiiu  l.K  as  he.n  il  \N  c 
.il  I  V  en  nine  i.ijnd  r.ites  .'-It!  umd-.  I'ei  ininiile  eliisl  .V  ( ',n  pnilfi ,  ic'S/i  !  he 
jiiicnomciioKn.'v  h.is  ninch  ot'ieelivi.  einjnrica!  sin  poii  t  .iijicnlei  .ind  lusi  c.ill  rtns  i.ijiitl 
mnnimi'  il  pi'oi -.m'.:  !in>iiidi,ic\  ot  oneni'riaiu'n  svii'.iUii,  sem.nilic  ,hul  r  c  In  cn!  i.il 
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Table  2. 1 ;  The  products  of  comprehension. 


Type 

Nature  of  Rep’n 

Evidence 

Referential 

Organized  around  referents  of  the 
text,  may  not  retain  intensional  dis¬ 
tinctions.  Form  IS  a  mode!  which 
represents  one  particulai-  situation 
consistent  with  text.  Aspects  of 
model  correspond  directly  to  aspects 
of  situation. 

Functionally  required.  Linguistic 
memory  does  not  always  reliably 
distinguish  expressions  that  actually 
occurred  from  referentially  or  infer- 
entially  equivalent  expressions. 

Semantic 

Represents  sense  of  expressions  (in¬ 
dependent  of  reference) 

Functionally  required.  Under  cer¬ 
tain  conditions  the  semantic  repre¬ 
sentation  is  retained  rather  than  the 
referential  representation. 

Syntactic 

Explicit  encoding  of  intermediate 
syntactic  structure. 

Functionally  required.  Practice  in 
building  working  NLP  systems  re¬ 
veals  this  requirement. 

processing  follow  immediately  on  the  heels  of  each  incoming  word,  and  are  rapidly  com¬ 
pleted  in  most  cases.  This  section  reviews  some  of  the  evidence  for  the  immediacy  of 
producing  the  three  representations  identified  in  §2. 1 . 

2.2.1  Immediacy  <}f  syntactic  parsing 

Syntactic  structure  is  computed  incrementally — an  incoming  word  is  integrated  immediately 
into  a  partial  syntactic  structure.  The  speech  shadowing  experiments  of  Marslen-Wil.son 
(1974,  1975)  provide  striking  evidence  for  the  immediacy  of  synt.uctic  processing.  The 
subjects’  task  is  to  repeat  back  speech  as  soon  as  they  hear  ii.  The  most  practiced  subjects 
shadowed  at  latencies  ol  .450  rns  or  le.ss.  When  subjects  i  >ade  errors  by  adding  or  changing 
words,  the  errors  were  consistent  with  the  preceding  .syntactic  context  well  over  SO'.f  of  the 
lime.  I'urtherinore.  this  con.sistericy  with  syntax  was  jui.t  as  likely  to  occur  at  latencies  of 
250  ms  (the  most  rapid  shadowers)  as  tl  was  at  b(K)  KKK)  ms.  These  experiments  indicate 
that  syntactic  [irocessing  of  a  word  (/ccurs  within  at  least  a  lew  liuiuired  inillisecoiids  of 
lieanng  die  word,  odiei  vvise  the  svntac'dc  context  woiiitl  not  be  aviulable  loi  generating  the 
next  shadowed  word  so  ijuickly. 

lire  Imlk  of  ecideiR.  foi'  svntaclic  nnm.ahacv  comes  fiom  tiacking  e\c  movements 
dni'iig  reading.  A  number  of  stnd'.'s  have  s.iown  Ih.il  eye  fixations  an.'  of  longer  duialion 
on  diose  i.ats  of  :.i  si'iitence  tliat  an'  sviitaciicallv  (litficult  oi  alloma!i.H!'^  le.::  ((  .iijicnlcr 
A  [.lanei  ian,  !  98 1 ,  I  ta/iei  A  Ravnei,  |98.’))  i  he.se  lesiilts  suggesl  dial  ‘aiti)ects  are 
a'.Iemplmg,  to  synt,.K  (k  .ili\  [sirse  m.itenal  as  .soon  .is  it  is  fixateif  In  i.ici  neailv  .dl  ol  die 
vast  ‘,el  of  ex(KMmen!s  exploring  sUnctntal  ..uii[ugm(-\  n  solution  i  foe  4 )  [uoxidc'  evufc'nce 
loi  svnl.u'iu  mmieiliacx 


2.2.  Imiucdiacy  and  (he  (itnc  coiir.'ic  of  cornprehem^ion 
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Limits  to  syntactic  immediacy? 

While  the  general  claim  is  uncontroversial,  there  is  still  some  debate  over  whether  all 
syntactic  information  becomes  available  on  a  word-by-word  basis.  If  there  is  evidence 
that  some  aspects  of  syntax  are  delayed  in  their  application  relative  to  others,  then  that 
may  provide  evidence  for  independent  modules  within  the  grammatical  knowledge  base 
(Fodor,  1988)'^  It  would  also  weaken  the  general  claim  for  syntactic  immediacy.  The  range 
of  possible  hypotheses  about  delayed  application  of  syntax  is  as  rich  as  modern  syntactic 
theory.  Below  we  will  consider  just  the  most  important  and  well-investigated  of  these 
claims:  that  verb  argument  structure  (or  subcategorization  information)  is  not  immediately 
available  to  guide  the  initial  parse. 

In  a  self-paced  reading  study,  Mitchell  (1987)  presented  subjects  with  material  such  as: 

(14)  (a)  After  the  audience  had  applauded  the  actors  sat  down  for  a  drink. 

(b)  After  the  audience  had  departed  the  actors  sat  down  for  a  drink. 

Mitchell  manipulated  the  transitivity  of  the  initial  verb  {applauded/departed)  to  see  what 
affect  this  had  on  parsing  the  immediately  following  NP  {the  actors).  Mitchell  found 
that  subjects  took  the  immediately  following  NP  as  a  direct  object  whether  the  verb  was 
transitive  (14a)  or  obligatorily  intransitive  ( 14b) 

However,  truly  obligatorily  intransiti’  r;  verbs  are  hard  to  come  by,  as  example  (14b) 
from  Mitchell’s  materia!  demonstrates:  dep'arted  is  not  in  fact  obligatorily  intransitive: 

(15)  The  actors  departed  >he  stage. 

The  inadequacy  of  the  materia!  therefore  makes  the  results  suspect;  this  problem  has  plagued 
other  studies  purporting  to  show  the  .same  effect  (Ferreira  &  Henderson,  1990). 

On  the  other  hand,  there  is  positive  evidence  that  subcategoiT^ation  information  is  used 
iminedialely  in  parsing.  As  Pritchett  ( 1992)  points  out,  the  garden  path  effect  in  (  I  t)  can 
be  avoided  by  using  a  properly  intransitive  verb: 

(  I'))  While  Maiv  slept  a  sock  fell  on  the  lli'or 

.Such  a  conliast  is  ililticult  to  cx.f)lain  li  ^ubcategori/ation  inlormation  is  delayed  ni  parsing. 

raiieniiaiis  and  ( 'aihain  and  cvtilean.nes  t  I9g0)  ha.ve  jims  aieei  a  gre,!!  da‘.d  ol  empnieal 
MipjHn!  toi'  die  nuniedi.ile  use  ol  lexical  mtoiinaluni  in  [XiiMiig  riiex'  sUiiiied  lilh.u  gap 
seiilenccs 

(I/)  lai  fhe  disiiiel  allornev  tound  out  winch  wilncss  tlu.-  icpoilei  .oked  : 
anxii'iisls  atsoiii  , 

do  1  he  dislne'.  .iiniiiic',  h.xind  iinl  wfni.ii  thun.h  ihe  rcrinriei  asked 

anxionsK  .inoin  , 
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(c)  The  physical  therapist  wasn’t  sure  which  bed  the  orderly  hurried  rapidly 
toward  y. 

Subjects  produced  longer  reading  times  at  potential  gap  sites  (marked  with  u)  of  tran¬ 
sitive  verbs  (asked)  when  the  fillers  were  semantically  anomalous  (17b),  compared  to 
semantically  plausible  cases  (17a).  But  the  long  times  could  be  eliminated  by  using  verbs 
with  an  intransitive  preference  (  i7c).  In  those  cases,  subjects  apparently  did  not  attempt  to 
associate  the  filler  with  the  verb,  indicating  immediate  use  of  verb  argument  structure. 


2.2.2  Immediacy  of  semantic  interpretation 

Sentence  meaning  is  computed  on  a  word-by-word  basis — an  incoming  word  is  integrated 
immediately  into  a  partial  semantic  representation.  The  speech  shadowing  experiments  that 
support  syntactic  immediacy  also  provide  evidence  for  immediacy  of  semantic  interpre¬ 
tation.  Just  as  subjects’  errors  were  generally  syntactically  consistent  with  the  preceding 
context,  they  were  semantically  consistent  as  well.  The  experiment  reported  in  (Marslen- 
Wilsun,  1975)  factors  out  the  contribution  of  semantics  to  the  shadowing  errors,  clearly 
demonstrating  that  the  effects  are  not  purely  ;yntactic. 

Important  evidence  also  comes  from  eye  movement  studies.  F'or  example.  Carpentei  & 
Just  ( 1983)  found  that  semantic  context  and  meaning  frequency  affect  the  time  that  readers 
fixate  on  ;i  polysernous  word.  Readers  spent  less  tun<'  on  words  when  the  high  frequency 
meaning  was  consi.stcnt  with  tiic  context,  indicating  that  readers  are  attempting  to  integrate 
the  incoming  word  into  the  meaning  of  the  sentence  as  it  is  being  read.  The  cross-modal 
primnig  studies  of  lexical  icccs.''.  afso  dcmon.strate  rapid  semantic  interpretation.  Priming 
effects  due  to  the  contextually  ifiappropriate  sen.se  of  a  poiysenious  word  disappear  after  a 
few  hundred  milliseconils  (e.g.,  (Swinney,  1979)). 

The  gap-filling  studies  of  Tarienhaus  &  Carlson  (I98d)  cited  above  provide  further 
cvidcritc  for  semantic  immediacy,  since  the  cliecl  uf  an  implausibje  lilier  shows  up  inline 
diatciv.  This  rcstdl  was  replicated  using  evimt  related  hram  potential^  (Taiicnhaus  el  al., 
1990),  '  he  relevant  Inidii'g  is  that  the  pallein  ol  hram  activity  ass  iciatcil  with  semantic 
aiidinalie'-  oecnis  ahoni  dOO  ills  atlei  encountering  :hc  .iiioniakius  iiart  ot  the  scntciivc. 


Idinits  }()  .seisiaiPii*  iiiiiiunJiacv? 

PsvcliohiiL’msts  aigumg  lor  sv  iilai'tic  iinnliiiarilv  li.ere  generated  main  results  winch  uiigdit 
t'e  uilei [11  ek'd  to  t.i\oi  a  model  m  winch  semanlic  mlerprciatioii  lags  signdu  antis'  i  at  least 
,1  ss  okI  Ol  mou' )  belli  nil  sv  lUacl  le  pi  occssmg  i  sec  tiic  studies  leU'i  i  ci  1  u  >  in  (jJ.  3 .  I  h>\s  c\  ei , 
llicse  e\|H',  imcnls  deal  exclusiseis  w  illi  ihe  application  o!  eeriain  kinds  ol  k  luns  ledge  !o 
lesiS-if  s\  (itae!ic  .milngnils  I  ht'ie  lias  been  nc  iliici.!  esuk  iue  shoss  nig  l  i.ii  sernaniii 
miei  pi  ei.ii  ion  e,  not  tia.ppv  nmg  iimiiediatels  Ihe  onl\  issue  i.nsed  b\  itie'.e  siiujies  is 
beihri  sein.iiii ic  i nloi  rn.ii loi  is  used  immedMielv  to  guide  lie,'  s  \  ni.iei  le  p.n  ^olg  1  he  ' o 
issues  ,i.v  li>;’ h  alls  iiuiepeiide ill 
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2.2.3  Immediacy  of  referentiai  processing 

C  mstructing  a  mental  model  requires  building  representations  of  new  entities  in  the  dis¬ 
course,  establishing  relations  among  them,  and  identifying  existing  individuals  when  they 
are  referred  to  (reference  resolution). 

The  referent  of  an  anaphor  (noun  phrase,  proper  noun,  or  pronoun)  is  computed  im¬ 
mediately  upon  encountering  the  anaphor.  Eye  movement  studies  by  Carpenter  &  Just 
(1977)  provide  evidence  for  immediacy  of  pronoun  resolution.  The  interesting  effect  was 
the  pattern  of  regressions;  subjects  often  moved  their  eyes  from  the  pronoun  itself  to  the 
antecedent  in  the  preceding  text.  Other  compelling  evidence  comes  from  the  cross-modal 
experiments  of  Tyler  &  Marslen-Wilson  (1982).  Subjects  heard  short  paragraphs  ending 
with  a  sentence  fragment  (such  as  He  ran  towards  . . .).  The  task  was  to  name  the  visually 
presented  continuation  probe,  which  was  a  pronoun  (either  him  or  her).  The  probe  was  al¬ 
ways  syntactically  correct,  but  depending  on  the  referent  of  him  or  her,  the  probe  was  either 
appropriate  or  inappropriate  with  respect  to  the  preceding  context.  There  was  a  naming 
latency  advantage  for  the  contextually  consi.stent  probe.s,  indicating  a  rapid  completion  of 
the  resolution  process. 

Dell,  McKoon,  and  Ratcliff  (1983)  pre.sent  evidence  for  immediacy  of  noun  pha.se 
resolution.  They  gave  subjects  texts  to  read  like  the  following: 

( 1 8)  The  burglar  surveyed  the  garage  set  back  from  the  street.  Several  bottles  of 
milk  were  piled  at  the  cur  b.  The  banker  and  her  husband  were  on  vacation. 
The  criminal  slipped  away  from  ihe  streetlainp. 

At  a  2.80  ms  oflset  Irom  Ihe  relevant  noun  phrase  (crimnnil),  subjecls  were  presented  a  word 
for  a  recognition  task.  Words  related  to  the  releieni  i.f  the  nounphia.se  were  named  faster 
than  words  that  were  luri.  l-or  example,  yarage  wouKi  be  pi  iinerl  280  ms  after  encountering 
criminal.  I  he  wools  were  only  related  by  the  reialions  esiablishod  in  the  text  itself,  as 
opposeii  to  general  knowledge,  hui  thermore,  the  relerring  expression  was  not  the  same  for 
the  twi)  lUHin  pliiases  (<  riminal  vs,  hu/.\;ho  I,  t‘»  ensim:  that  the  etlecl  was  due  io  iiderenlial 
pioi  essing  and  not  sonu'thmg  moie  su|'erf!i.'ia! 


l  imits  to  referential  iiiimediaev 

Whde  the  e\  ideiiee  eleaib'  supporls  imiiK'dial!;'  initiation  o'  leieieiuv  leseaiition,  the  datr. 
eorieernnig  ttu.‘  <  iimidciian  oi  die  ['>hk'c-s  is  inoie  eoinplex  (.S.edoui  A  t  iairod,  0*89)  I  oi 
!>i  th  ['loiioiiiis  ,iiid  noun  I’hrase  aiuiplMis,  tiiere  api'eai  to  be  ease-,  \s  iu‘ie  he  n. solution 
pi  'et’',s  Is  no!  i.'oini'ik‘ti.'u  until  wel I  ai  lei  llie  aii.iph or  i--  i iiil  lal  1  \  eiii  on niei  ei I  i  (  e.i  penk'i  A 
liki.  !  9  ,  / ,  I  111  1 1  \  ,X'  K.iViiei,  1 99(1  ( ,i(,-rii(,-  kp  Koon  A;  isaieiili.  |9o'’j  In  si  lue  e.ise-.,  (he 
Ninuiiiie  ol  die  le\l  ilsell  mak(.'s  it  im[iosM!'le  i.n  eonci  il\  hleiilil'  (he  lete  eiu  when  (lie 
.i!),i['hoi  I-,  i  neounteied  l>iil  otiiei  eases  .ijspeai  to  l>e  R'I.Jei.i  lo  (he  | u  oee-,s! ne  U'i|ii)[ed  lo 
eo-nipiile  die  leteieiil 

!  i  I  eene  1 . ! I .  i  r  i i  - 1  eiu  e  i  esoliii  ie>n  and  ii  leiilal  moiiv'l  eon-.i 1 1 let  ioi  i  i >  ;\  i  ei  | ii  1 1  e  an  .ii  i 'it i  ,ii  \ 
.aiionni  oi  inieKiiee  llieie  is  eknirlv  some  limit  lo  die  I'loeessnie  ilial  e.ui  liap'peii  ni 
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real-time  comprehension,  and  this  must  be  under  deliberate  control  to  some  extent.  These 
limits  have  been  explored  in  a  variety  of  experimental  paradigms.  The  results  of  the  Mani  & 
Johnson  L.aird  (1982)  study  discus.sed  above  suggest  that  a  mental  model  may  not  always 
be  constructed  or  retained  if  the  text  is  difficult.  Swinney  &  Osterhout  (1990)  presents 
cross-modal  priming  evidence  for  a  distinction  between  perceptual  (automatic,  on-line)  vs. 
cognitive  (deliberate,  post-processing)  inferences.  McKoon  &  Ratcliff  (1992)  also  provide 
evidence  fora  minimal  amount  of  automatic  inferencing  during  comprehension.  In  general, 
the  content  of  the  referential  representation  may  depend  on  a  number  of  variables,  such 
as  the  amount  of  time  available  for  comprehension,  prior  knowledge,  attention,  goals  in 
reading,  superficial  features  of  the  text,  and  .so  forth  (Just  &  Carpenter.  1987;  Oakhill, 
Garnham,  &  Vonk,  1989;  Schmalhofer  &  Glavanov,  1986;  Simon  &  Hayes,  1979).  The 
control  of  inferences  during  comprehension  is  still  an  open  re.search  issue  in  AI  as  well. 


The  time  course  of  processing 

Studies  of  eye  fixations  durings  skilled  reading  indicate  that,  on  average,  each  word  is 
processed  in  about  250  ms  (Just  &  Carpenter,  1987).  However,  the  amount  of  time  spent 
on  each  word  can  vary  greatly,  from  as  little  as  50  ms  to  1000  ms  or  more.  The  time  spent 
IS  a  function  of  many  features  of  the  text,  such  as  word  frequency,  syntactic  complexity, 
familiarity  with  content,  and  ambiguity  (e.g.,  (Just  S:  Carpenter,  1987;  C'arpenter  &  Just, 
1983;  MacDonaldetal.,  1993). 

2. .2. 4  Tht'orie.s  of  iiiinu  diacy 

.Although  imnietliacN  is  a  central  tenet  in  a  number  of  unportanl  ci'mpiehensam  theories, 
tliere  ;ue  verv  few  conipntational  theories  tliai  actiiaih  model  the  time  course  of Gompre 
heiiMon,  I'iirs  i'‘qnn>..''  dc\ c !«>piiii;,  a  conu>ie‘iCii,ac>ii  liieoix  \sillun  some  computational 
.nclnlci'lnic  and  gix  mg  .(  lenqioral  miei  prelalion  to  the  processing  pnnnti\  es  o!  the  aii'In 
ti.at  lire 

ivl  API  !■;  (  Hull. uU, III  el  .if,  I'AS.’,  .Iiisi  G  ( '.II  pcmlei,  l‘hS.'’l  is  ilii'  lirsi  exampk'  of  a 
luncIioiialH  complete  model  If. it  .ieeounts  loi  tlu;  lime  e>)iuse  ot  eonqiiehension  m  .iii\ 
si'’mlieani  \\.i\  td  \l>fK  is  desidoiHsi  iii  the  e M's  pividnelioii  s\slem,  v\[iKh  oper.des  viii 
eoiitniuoiis  evs'les  ol  m.ilcli  .iiid  .leto.ilion  piop.ig.ilion  I  lubade.tu  el  .il  show  that  In 
niiei  fiu-lme  the  luimher  ol  processing  c'\eles  that  ( Wl’.s  I, ikes  pei  woui  .is  ,i  me.isuii'  ot 
le.idiii;.'  lime  pel  wold.  Id  API  S’  Is  .ihle  to  pio\ide  a  r'ood  .K'conni  ot  the  le.idmg  times  ot 
!mi  I  i.ii  i  sui'|i.w  Is  1  In,- 1  e  .ill'  I  vs  o  iiMiiiu's  s  i|  tlu  nun  lei  th.il  c  onti  ibrUe  to  the  eood  lit  1  list 
ilia  Ic  \  h  on  III  n  I  \|  'I  IS  son  Or  Ik  Ic'd  so  that  the  mil  i.ii  .ml  i  \  at  loi  i  f '\  a  Is  ol  wool  senses  is 
.1  I  ink  lion  oi  i  he  1 1  IS  |i!Ci  u  \  otih.ii  w  oi  d  smlse.  so  ih.il  low  lieqnciic'v  suisesieqiiin  mole 
lVcIc  imii-  to  lioosi  .1,  1 1  \  .11  :oii  lo  d.ushi'lil  ii'\('ls  .'scs  ond  smi,e  Id  \dlf  i'mhodi'-s  i':e- 
!  sis  le  pi  1 1  le  1  [lie  o I  m  ii  iied  i.ic  \  ol  mtei  pu-i.il  ion,  it  s[H‘iu.is  longc'i  on  w  oh!  ,  I  li.il  i  n  ii  i  usli.ile !  \ 

1 1  m  eci  sig n  I  i  Ik  .ml  .n  i  niunt s  ol  -  \  ni.ici n  seni.ml ii  oi  i  e.n'i  enl  i.ii  pn ea''.,si ne 
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2.3  Structural  ambiguity  resolution 

Natural  language  ambiguities  may  be  classified  along  two  independent  dimensions:  content 
and  scope.  Content  identifies  a  particular  aspect  of  some  representational  level — for  exam¬ 
ple,  lexical  semantics  or  syntactic  attachment.  Scope  refers  to  the  span  of  input  over  which 
the  ambiguity  persists.  A  global  ambiguity  typically  refers  to  an  ambiguity  that  cannot 
be  resolved  by  taking  into  account  the  entire  sentence.  A  local  ambiguity  is  a  temporary 
ambiguity  that  w'ill  be  resolved  at  some  later  point  in  the  sentence.  For  example.  (19) 
exhibits  a  global  structural  ambiguity  at  with  (underlined).  There  is  a  choice  at  the  syntactic 
level  v/hich  cannot  be  resolved  by  the  end  of  the  sentence:  the  prepositional  phrase  may  be 
attached  to  cop  or  saw. 

( 19)  The  man  saw  the  cop  with  the  binoculars. 

(20)  exhibits  a  local  lexical  ambiguity  at  can  (auxiliary  vs.  main  verb  reading)  which  is 
resolved  by  the  end  of  the  sentence  (in  fact  the  next  word). 

(20)  These  factories  can  tuna  very  efficiently. 


Lexical  ambiguities  such  as  (20)  which  are  syntactic  in  nature  effectively  give  rise  to 
structural  ambiguities.  Not  all  lexical  ambiguities  are  structural;  ambiguities  such  as  (21) 
are  purely  semantic: 

(21 )  The  old  man's  glasses  were  filled  with  siierry. 

Holh  interpretations  ol  glasses  (drinking  vs,  (>ptical)  yield  pieciseiy  the  same  syntactic 
structure. 

The  l  emaiiider  o!  this  section  reviews  .'■ome  of  the  plienomena  sun  oundmg  the  l  esolution 
of  local  structural  ambiguity.  Lexical  amhigmtv  vvill  hr-  considered  only  to  the  extent  that 
It  g.n  es  rise  to  slriKluial  aiiibiguitv,  as  in  (20t.  I  he  ceiiir.il  tlieoieiir  al  tjuestions  are:  What 
knowledge  souict.‘s  are  brouglu  to  be.u  in  n-soh  irig  the  ambiguities,  and  how  and  v\  he!i  are 
die^-e  souiei's  .ippluaf* 


2.J.I  Striutiiral  [)rtdVt nuos 

(  ei  tain  ainlugn  It  les  ha  \  e  jiu-ti-iied  inter[ii  et.ii  n  uis  dial  eai  i  In  ehai  aelei  i  ,-'ed  in  puu'l  \  stine 
lui.il  leiins,  I  Ol  e  x.irnpie,  eoiiNidei  ( ) 

lb. id  s.iid  tli.it  lon\  ilew  to  .Atl.inta  \esleid.i\ 

I  lus  e  o  a  pi  e  I  ei  'MU  e  to  a'-soi  ante  \  a  s/e/./iO  w  1 1  fi  th  \\  i  .ill  lei  I  h,n  Siiu!  i  lu  uig  li  In  it  1 1  i  iiU  i  j  m  , 
i  Ml  ions  .lie  ei  1 1 1.1  IK  ei  .niimal  leal  and  j  ifneable  Kuiib.il  I  i  O'  '  '  i  .it  1 1  1 1  ml,.  ■,  1  h  e.  to  a  ,  '-n'  I  ei  (  lus 
O'l  hinn.iii  I'.iisiin.’  >.,iifed  luetn  uitwri 

i  ’i)  liMiiiin.il  intuil  s  v  m 'In  nal  I  \  aseoeMle  lollie  low  (■',[  luai 

'ei  iiMiial  nodi 
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s 


NP 

John 


V  NP  PP 


boughtthe  book 


P  NP 
tor  Susan 


S 


NP 

John 


VP 


V  NP 


bought 


NP  PP 
the  booky'^ 

P  NP 
for  Susan 


Minimal  Non-mininial 

Figure  2.  l ;  How  to  minimally  attach  a  PP.  The  minimal  attachment  introduces  tne  fewest  number 
of  new  nodes. 


Since  the  VP  node  headed  by  flew  is  iower  than  the  one  headed  by  said,  the  incoming 
material  is  attached  io  flew.  Two  other  important  preferences  proposed  in  the  literature  are 
the  Canonical  Senioid  Strategy  (Bever,  1970;  Fodor  ei  al.,  1974)  and  Minimal  Attachment 
(Fra/ier  &  Fodor,  1978); 

( 24)  Canonical Sentoid Strategy:  . . .  wheneverorie  encounters  a  surface  sequence 
NP  V  (NP).  a.s.suine  that  the.se  items  are.  respectively,  subject,  verb,  and 
olrjecl  or  a  deep  .sentoid. 

( 2.^ )  Minimal  Attachment:  Fach  lexical  hem  (or  other  node )  is  to  be  attached  into 
the  phiasc  marker  with  the  fewest  number  of  nontermmai  nodes  linking  it 
with  the  nodes  which  are  already  present. 

rite  (’anoriieal  Sentoid  Siraiegv  accounts  for  the  br/arie  interpretation  of  {2b)  (Pritchett, 

(2('i  .l.ijianese  ini-'h  iHailes  npCfiinese 

i  he  sequence  ./n/'i.'.nc.sc  /'//.^//  is  nile'qireled  as  subject  verb,  rathei  than  as  an  NP  I  he 
I'U-poMiional  phrase  .unf'ieiniN  in(2/)  iliiisti .ites  bow  Minini.il  .AtlachriHuit  c  oiks: 

. ■  I  .hiliii  tionehi  I.lie  f>oc>k  !.>i  .Siisai: 

1  leni,.-  I  i’lces  (lie  siiuciiiies  t,.u  tin  luci  jiossihle  .iit.ichinenls,  Miinnial  .Atl.ichnieiii 
si'lci  Is  (he  one  \\  I'h  the  tewei  i  .ules,  thus  jut  Jr  Ini;,',  the  ptelei  leti  .it'.icliiiient  ot  tor  SuMin 
!.)  liie  \  1’  iKsle. 

I.ibic  .'  '  lish.  ,1  iiuiiitHn  ot  Niiiu  tut.il  I'lt'lereiu'es  (h.il  h.oe  t'eeii  nleiitilied  in  ilie  hlei 
.iln.ie  lhe\  .ne  hs(ed  tune  lUiinaiiK  as  .lescnplioiis  nl  ph(.'nonK'n.i.,  Ailthuu  .ni\  inlinilu  .n 
ol  Juixni!'  ilicin  iii;s niei  u  ,i!  s|,.iliis  hi  s.’  i  '>  ue  will  coiisulei  soiiu'  tliesC  .e.  iheoietu  ,il 
Cv  'I  I  '.I  (  ‘h  1  s 


SiaictLinil  :uv.higuity  resolution 


TABLii  2.2:  Some  structural  parrsiug  prcferer.ces. 


Canonica,!  Sentoid  Strategy  Interpret  N-V-N  string  as  subject-verb-object.  .Fixampie: 
(Bever,  1970;  Fodor  et  al  .,  Japanese  ,>ush  bottles  up  Chinese. 

1974) 


Right  Association 
(Kimball,  197:-l) 


Attach  lo  rightmost  (lowest)  open  phrase.  Example:  John 
said  it  would  rahi  yesterday.  Yesterday  attaches  tc  rain. 


Early  Closure 
(Kimball,  1 973) 


Close  phrases  as  soon  as  possible.  Example;  They  knew 
the  y'.  I  was  in  the  closet.  The  S  node  is  closed  prematurely 
at  the  girl,  accounting  for  processing  difference  with  the 
unambiguons  They  knew  that  the  girl  was  in  the  closet. 


Minimal  Attachment  Attach  with  minima!  number  of  nodes.  Example:  John 

(riazier  &  Fodor,  )  978)  bought  the  dress  for  Susan.  For  Susan  attaches  to  bought. 


A-over-,A  Early  Closure  Given  two  phrases  in  the  same  category,  the  higher  closes 

(Church,  1 989)  only  wb^n  bofhare  eligible  for  Kimball  s  ly  Closure.  Ex¬ 

ample:  I  called  the  guy  w  .  <o  smasned  ni\  car  a  rotten  driver. 
Driver  may  attar ::  -o  coded  bet”>use  the  1  it  s  '•  remains  open. 


l.ate  Closure  Delay  closing  corsiitneuts;  prefer  t  .  aach  new  material  to 

(Frazier  &  Ray ner,  1982)  existing  nodes.  Example:  Since  Jay  jogs  a  trule  ste  ns  like 

a  short  distant-'’.  A  mi!.  -  attaches  to  jogs  initi  tily,  causing  a 
garden  padt 


Prefer  Arguments  I’refer  aigumen:  (c\>m;)!emeni)  attachmems  over  rion- 

( Abney,  1989)  argiimeiU  (adjunct)  atlai  iiments.  Example:  'I'he  man  e.v- 

pressed  interest  ’/i  the  Voivr>:  in  tin'  Vci  ah  aches  to  lnie>' 
est  as  an  ai;.,uiiievu ,  rather  tliaii  to  e.xjtresstil  as  a  1.x.;.uivl 

ad|uiK:(. 


ilo'.v  lohirsi  are  liic.se  prelcieuces'  lines  hiirn  n)  paiertig  aiways  opr’rak'  in  acroi'daiire 
vt  rih  some  i^yiit.iClu'  pu’leicnceis),  oi  (.:an  ihe;-.!  [ireleu  t'ce.s  be  chaiigcd  hr  scni„,.)lK:  oi 
[iiagmatic  laclois''  Diav  ing  in  p.iit  on  llic  h vpoiheses  ol  1  (.dra  (  '983 !  and  l  oistci  (  1979), 
mans  jisvchologists  tia\i‘.  finiposed  that  .sMitaciic  pioc-s  aiij.'  is  nuKlulai.  l  lie  r  ',uni  is  iti.il 
anfOM  iiuno  ssiiiaclk  nmiUile  is  ii.;-.piin‘.if'lc  loi  mi aetia  un;  iingui  du  iiijuit,  an.  ibis 
Sinn  ".111  ini.',  is  .k  con iplislied.  (  hiiiKtll  v.  at  Ic.i-.i )  w iih(,.u!  legaKl  !•:.»  itf  m  s  \ , a aci ;r  .  ni  e.rinai i<  ,n 
i  odi>i  I  i'Te,  I  .ilp,  ill!',  laltei  wh'cli  an  essential  U’.iiU'e  v'!  nasliilanls  re/e  . 

■'i. i/a  O'  I'fu  ill  "■  ii nil !( ‘f:  1. etdi."!  11  u‘  n  .odiihii  lU  s  ie\% ,  ti le  ii liei C’  ;  a  ly  i soie  i  ,  n  O'  1 1 iig  < m :  \s  tial 
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ambiguities,  because  these  principles  may  reflect  the  basic  nature  of  the  syntactic  processor. 

A  number  of  empiricail  studies  have  been  cariied  cut  in  support  of  the  modularity 
hypothesis.  Nearly  all  have  the  .same  basic  structure; 

«  Subjects  are  presented  with  materia!  that  contains  a  local  .structural  ambiguity. 

9  The  manipulated  variable  is  some  non.syntactic  context  prior  to  or  including  the  local 
ambiguity. 

e  Readings  times  are  recorded  in  the  disambiguating  region,  as  a  measure  of  compre¬ 
hension  difficulty. 

•  If  the  measure  of  difficulty  is  independent  of  the  nonsyntactic  manipulation,  this 
is  taken  to  support  a  purely  syntactic  ambiguity  resolution  strategy,  and  a  modular 
parsing  architecture  irnpervio;  to  context. 

For  example,  Ferreira  &  Clifton  (1986)  present  evidence  that  ambiguous  NP-V  strings  are 
interpreted  as  subject-verb  (in.  accord  with  Minimal  Attachment  and  Canor.i'  al  Sentoid) 
regardless  of  the  implausibility  of  the  reading.  Matenai  included  sentences  such  as; 

(28)  (a)  The  evidence  examined  by  the  iaw3'e',-  turned  out  to  be  unreliable. 

(b)  The  defendant  examined  by  the  lawyer  turned  out  to  be  unreliable. 

In  (28a),  the  inanimacy  of  evidence  makes  it  implausible  that  the  evidence  was  doing  the 
examining.  Yet,  increased  reading  times  (over  unambiguous  controls)  were  detected  in 
both  cases  in  the  disambiguating  region,  suggesting  that  subjects  incorrextly  interpreted  the 
first  verb  as  the  matrix  verb. 

Table  2A  summarizes  some  of  the  c  .periments  supporting  syntactic  modularity  gen¬ 
erally  (and  Minimal  Attachment  specifically)'’  The  structure  of  the  table  is  as  follows; 
Amhipuity  refers  to  the  kind  of  structural  imbiguity  studied;  Manipulalion  i  cfers  to  the  kind 
of  nonsyntaclic  information  that  the  ambiguity  resolution  was  found  to  be  insulated  from; 
Method  refers  to  the  mode  of  prc.scntarion  and.  beiiavioral  measure. 

Tlic  results  of  these  studie;;  should  be  interpreted  with  care  (Tancnhaiis  ct  al.,  1989;  Ty  let; 

1989),  (icncrali/ing  over  tlic.se  mater ia.l-. . for  esiu.upk-,  lo  concluvlc  tfosu  the  I'cnrira  & 

Clifton  (1986)  study  lh;il  ammacy  never  atbxis  ou-itne  amOignity  resoluUofi  would  he 
ill  ailv'scd,  especially  in  hglil  of  Uic  iistemctive  results  dicru.'.se;!  u;  §2.;''  ,  *.  Nevertheless, 
'he  stiK:  rs  do  sh<c,v  at  lea.vt  that  tliere  are  some  combinatioiiS  of  subjeti.s,  anibi  ’Mity  types, 
and  iionsyii'aclic  content  Jiat  go'C  r-is'.*  to  modular  cffeciS, 


rV  c(!Li|ni-  ol  vvi  II  y  ini'vi:  ivl'iiinKi;  Ari  u.iiiiu-in  MuCres  ,uo  ^■ot  isii  pKlts!  titn;'  Hie  t  'K'.  R.iy.ie) 

(  I'lSri  sUiu'.  co..ibli,sti(si  a  p'rlviprei;  t  it  Miiiini.il  .\n.ieliiiii-n',  t'.ui  div!  .a  i!  .rx jilieilly  e.raniuH:  snv  -u-iisyn 
larlu  i  ieluc.  t  in.  i<.ivi;ci  fi  ai  (  ; OK,!}  sfiuiv  iieiiirnilatix' (:tai!'.it)nUv,  bi!t 'lie  ^rTna.i! ic.d!  .■  ar,  iin;  uiiv,’ 
n  'I  I  lOii.iiu.ii  euaif  ,il  f'-i  ilu-  Kx  al  m'  liisi  nihii  mat  uni  i,  nuld  n-u  a.i\  e  been  hr.)i,l,:'iii  S'  I;:,;!  i  m  iint,' 

in  ,1111  liieiin  ii'xee;!l  inK-  inv  i.' msi  liHikiiiif.ail  i 

!'>'■  i’ljai  fl  ,.ii.  (IS”'  'i  r ; 'Ct  iineiii  wlMi.h  .■.a-  iiii.- lidco  i'l  !t -.1  ioi  v.ni((.‘i  rai  iii.iUr  ■  ni  x  i.! 

aihli’i'i'iUi''.  n,  '’  'I  1,1.  li.iik'd  I'l  'tre  inc  irvUTial  dui  ruir  aiOiuiii',  .■■eiii  l  ,iii  'uvli  (Ik' 

■  i  ln' ■  I  I'iicslli,.!  ii.Si.i,  ■  :  l  iinm  ' ,  h,iny  n. Iiiv  p'  ‘Oi  nmrt.na  r 'sririii'.  tin  ihr  i  .  :  i ' ;  u ,  n . k  :  n:  '  j ’I 

1*  ’  ,u  hr  mill  ■  i;  yrli  nif!)  I  1  r  [)■  iMr  '  !n  I  hr  \,.i!ir  p.ij'CI  I  ih a 
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Tablk  2.3:  Some  studies  demonstrating  modularity  effects. 


Ambic.uity  type 

iVlANiPUL.ATION 

Method 

Vulin ■■  v'erb/reduced  relative 

i  Animacy  of  subject 

Context  (#  of  referents) 

.CT  (F^errciia  &,  Clifton,  19S6; 
Just  &  Carfienter,  1992) 

El  (Feireira  &  Clifton,  1986); 
SPP  (Femeira  &  Cl  fton.  1986) 

1 

Context  (discourse  focus) 

ET  (Rayner  et  al.,  ]  992) 

Cornpiemerdrelative  clause  j 

Context  (#  of  referents) 

SPW  (Mitchell  et  al,  1992) 

PP  attachment  (arg/adjuncS)  j 

Context  (#  of  referents) 
Context  (discourse  focus) 

ET  (Feixeira  &  Clifton,  1986) 

ET  (Rayner  et  al.,  1992) 

ET  =  eye  tracking,  SPW=  self-paced  word-by-word,  SPP  =  self-paced  phrase-by  phrase 


How  often  will  a  parser  guided  by  purely  .structural  preference  chDose  the  correct  inter¬ 
pretation?  Though  most  preferences  have  been  motivated  by  a  narrow  range  of  linguistic 
exantp'es,  there  have  been  some  recent  analyses  of  natural  corpora  (Gibson  &  Peadniutter, 
1993;Hindle&Rooth,  1 99 1 ;  Hobbs  &  Bear,  1990;  Whiuemore& Ferrara,  1990).  The  study 
by  ‘2'’hitternore  &  Ferrara  ( 1 990)  tests  the  predictions  of  se vei  al  different  attachment  heuris¬ 
tic;.  an  725  sentences  containing  prepositional  phrase  attachment  ambiguities.  No  single 
heuristic:  tuey  considered  works  better  than  55%  of  the  time.  What  works  best  is  essentially 
a  combination  of  lexically-specific  information — (^specially  argument  structu  'e — atid  right 
association.  This  basic  pattern  also  held  in  other  analyses  (Gibson  &  Pearlrnutler,  1993; 
HtndJe  &  Rooth,  1991;  Hobbs  &  Bear,  199G)^ 

Apart  from  the  inadecjuacy  of  any  single  strategy,  perhaps  the  most  striking  result  of 
the  Whiternore  el  al.  study  is  the  dismal  performance  of  Minimal  Att.achment;  it  correctly 
pi  edicts  attachment  o.ily  36%  of  the  time.  These  result.s  do  not  in  and  of  themselves  falsify 
any  structural  preference  as  a  theory  of  on-line  {larsing,  because  the  data  is  b.:iscd  only 
on  tiic  fill'. I  prcfcired  structure.  But  it  docs  .make  abundantly  dear  the  need  for  a  theory 
of  nuinairsis  to  cavmpleincit  the.se  preferences,  othervvi.se  they  risk  grossly  vivetpredicf  ing 
garden  path  eifects  .see  §2.4), 

1-3.1  .t.ej'ical  ftrefereocfS 

The  giceilic  lexica!  content  of  an  ambiguous  sentence  can  sometimes  make  a  (.iiflercnce  in 
a:-  prdeircd  interpietation.  Ford.  Presnan,  and  Kapian  (1982)  show  that  dijicreni  vsorris 
may  oiffei  m  die  argumc.iis  they  prefer,  fhe  crucial  motivating  csaiujiics  arc  rmnimally 
cvtiUi  asling  :'.cntCfices  ;:::,’di  :.i>  (29): 

'’  i  i.e  i.t.'C,  ..III  );-'C  (U  JVCri'hititMils  aeiaiarccij  )ci  hy  ,i  It'XKVil  asoKKUiiin  '.(ialrcv  was 

aj,vi.!r  la  ii.e  yji'' .?sri  as  1’ •.irliccuei  '  0>  f  h  stijiiv  Stirn  ia  itic  cLuia;r  :  laci!' ..  .'"ha  cuiiuic;  aU)  ifnia;;,! 

hii ::  ii j  '  1 1  i,:r,  lor?  I  I  Tijc  (.'aOi';-)'  siuiiics  ■o.a.iscd  cArius!  '.'cl V  I  .r;  N-V\' t'  an',;,'ii  li.r'i'i  !r: w  h:  ic  !  ta. ci 

■■.lah  .  iiiiK.c)  ajubiaaiii;’-  ?! :  )i\  nri;  vUm;  ul  dk'  W'hjtcrii.nc  cl  dS,  '.uiih  ihi:'  liitia.ii". 

a.'.  I  ' 'ai '.i'C-u  ...hiK  I'.!,' tpicai  a-'ic  !’c,>i;iiii.ii.'er '.'a.iv  iim'.:)  cr .iicj'le  takci  h  siaihc  f!oiw,r: 


^r-iaafsau 
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(29)  (a)  The  wiarnan  wanted  the  dress  on  that  rack. 

i,b)  "rhe  woman  oositioned  the  dress  on  that  rack. 


In  (29a),  the  preferred  interpretation  as.sociates  on  that  rack  w/jih  dress,  but  in  t29b),  on 
that  rack  is  associated  with  po.<;ition.  Since  the  alternative  strtictures  for  both  seutence.s  are 
putatively  ident'cai  and  on;y  differ  in  the  specific  verb  used,  FonJ  et  ai.  arg  ie  that  this  is 
evidence  for  a  purely  lexical  preference.  They  established  the  phenomenon  wr.  h  .an  informal 
survc}'  of  subject  intuitions  about  the  meaning  of  sentences  like  (29).  Some  of  the  re  .lUs 
of  grammaticaiity  judgment  tasks  reported  in  (Kurtzman,  1985)  provide  further  evidcr ,  ^  of 
lexical  effects.  The  PP  attachment  studie,s  mentioned  above  also  indicate  effects  of  lexica! 
prelcrences,  but  not  noces.sarily  the  specific  kind  piroposed  by  Ford  et  a!.:  rucst  of  these 
(-.fiects  may  be  due  to  simply  to  a  preference  for  arguments  o  ver  adjuncts. 

Ford  et  a),,  note  tlTt?  conte.Kt  can  apparently  override  lexical  preferences: 


130)  (a)  When  he  arrived  at  our  doorstep,  I  could  see  that  Joe  carried  a  package 

for  Susan. 


(h)  Whenever  she  got  tired,  Joe  carried  a  paclcage  for  Susan, 


to'cn  in  the  absenct.  of  biasing  contC-Xt.  lexica!  prereren.;es  mu.st  not  be  ab.soiute,  bec,ai.ise 
as  Gibson  ( !  99 1 )  points  oul,  couiUerexatnpies  can  be  found: 


(3 1 )  T  wanted  the  diess  for  Susan. 

Here,  the  PP  dovt.s  seem  to  attach  to  the  verb,  counter  to  the  preiVc  nce  used  to  explain  (  29a), 


2.3.3  Sematitic  asid  coetextual  effects 


I..ocalIy  ambiguous  material  is  sometimes  interpreted  a.s  a  function  of  the  -oca!  .sornantio 
content  of  the  sentence  itself,  or  the  prior  discourse  context.  .Pudi  eff'ec  s  are  called 
haeractive  since  they  demonstrate  the  isiieraction  of  ;iiuitip:e  Knowledge  sciurces  lO  the 
coinpreliensi.on  process  (^4ars'ien- Wilson,  i975).  For  example,  T,:ine';!fK.iu,s  c*  ai. 
and  Just  &  Carpenter  ( 1992)  present  evitience  suggesting  that,  coiila'ry  so  what  Pcii  cii  a  & 
Chiton  ( 1986)  loiiiid,  ,sonie  subji’Cl.s  do  in  fact  make  'Ujiid  use  of  aniuuu;y  information  in 
resolving  the  local  an'luguiiy  oi'  sentences  life  (28).  Caun  &  StcedmaH  (  j9H5  j  produeod 
simiiar  te:su.Us  on  a  rapid  graminaticali'y  pidgnien!  task,  show'ing  that  ihe  piausihilify  of  the 
grattiiuatica!  reading  affected  the  chanc.:''  of  sentences  vsith  reduced  relative  ambivs  iues 
being  caiied  gracin:  .Uicai. 

Situc'cie!  a>’;rhg'.iide;.  may  also  be  resolved  by  appeal  to  the  r  nnent  context,  i  elei  & 
iVl  aslcn  W‘Sso:i  i  19/7)  present  sti'cmg  evidence  of  the  rapt  .1  clfcci,  coniexl  can  iiave  on 
synSiiictic  pioeessui;:'  isee  jIms  ( Mairsien-  •iVil'O!'  <Sc  Ivlc..,  iCH/))  biugects  lieatd  sentence 
tr.igi.u'co-  like  (.C),  r'Ciio'g  with  aoibiguous  sirm.r.  ini  ’u Tceh 


is2)  ia)  11  voa  .v.'T  >oo  neat  the  u'nwav  lanauso  sdasrs 
lb)  b  voa’ve  trauea;  as  *  piioi.,  iamoio  tuofito. 


2.3.  Slruclural  ambiguity  resolution 
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At  the  offset  of  the  final  word  in  the  ambiguous  plirase  (_n/an«-),  a  probe  word  was  visually 
piesented.  TTie  word  was  a  verb  w'hich  was  a  continuation  of  the  sentence,  d'he  subject’s 
task  was  to  name  the  verb.  The  contextually  appropriate  continuation  is  are  for  (32a)  and 
is  foi'  (32b).  Appropriate  continuations  had  a  naming  latency  idvantage  over  inappropriate 
continuations,  indicating  that  the  context  had  a  rapid  effect  on  the  initial  analysis  of  the 
ambiguous  string.  Using  a  different  technique  (rapid  grammaticality  Judgment),  Crain  & 
Steedrnan  (1985)  show'ed  that  the  number  of  referents  establislied  in  the  conteid  can  affect 
the  perceived  grammaticality  of  a  locally  ambiguous  sentence. 

Demonstrations  of  contextual  or  semantic  effects  have  sometimes  been  criticized  for 
not  being  serrsitive  to  on-  line  attachment  choices  (Clifton  &  Ferreira,  1989).  The  critics 
emphasize  that  what  is  at  issue  is  not  the  final  interpretation  given  an  ambiguous  .string — no 
one  denies  that  nonsyntactic  information  can  ultimately  have  a  tremendous  influence — but 
the  initial  attachment  choice.s  miade  by  the  parser.  While  some  experiment.s  may  be  subject 
to  this  criticism,  it  should  be  clear  from  the  v^xampies  above  that  there  are  demonstrations 
of  interactive  effects  that  use  techniques  sensitive  to  the  immediate  interpretation  of  am¬ 
biguities  (e.g,  the  pruning  techniques  of  Marslen -Wilson,  and  the  eye  movement  studies 
of  Carperuer  and  Just).  Even  the  .studies  which  coulu  arguably  be  insensitive  to  on-line 
processes  (tho.se  employing  rapid  grammaticalitv  judgments)  produce  results  that  are  prob¬ 
lematic  to  explain  with  a  strongly  modular  theory  (Altmann,  1988;  Steedrnan  &  Altrnarin. 
1989). 

Interactive  effects  have  now  been  demonstrated  across  range  of  5;yn.tactic  ambiguity 
types,  knowledge  sources,  and  experinetita!  echmqne,-..  Table  2.4  summarizes  some  of 
these  studies  in  the  same  format  iscd  in  'Fable  2.3  to  present  the  rnodulai  exprerirrients. 

2.3.4  Limttcc'iS  paraHelism 

Most  of  the  discus.sion  above  iuts  implicitly  assumed  that  a  single  interpretation  is  selected  at 
ambiguous  point.s.  However,  it  is  possible  that  multiple  interpretations  might  be  computed 
ai'uil  maintained  in  parallel,  and  this  is  an  import:;tnt  theoretical  and  empirical  issrre. 

.Ill  fact  ,  a  number  o)  early  studies  fmit  showed  clear  eifect.s  of  ambiguity  were  taken  to 
supprort  a  li,mited  middp'.r  meanings  mofiel,  in  which  i.nullii>!c  meanings  of  an  arnliiguoi.s 
pii'a.sc  ,ir‘e  crtmpnted  and  maintained  imti!  context  select',  one,  or  miti!  the  cunent  clause 
.s  dosed  idiariv  &  Clark.  !977).  FTi  e.xampie,  MacKay  (i9(>6)  pirsenird  snlijceis  wnlli 
amicnce  traements  to  complete,  and  discovered  thas  rimbiguous  fragments  took  longer 
fr)  ccuiifdcic  ttiau  unambiguous  !rag.rnciils,  (..ackner  ts,  (iarrett  (lh72r  deinonsirated  an 
amlcguiiv  el  Icct  in  an  interesting  task  where  subjects  were  requir  ed  to  sku  d()!irasc  a  seatcucc 
heard  in  one  cat,  while  ignor'ing  a  sentence  hearxi  rn  the  other  isii,  iftt-  t!iu-u>ielaii('ii  or  an 
anihigiicnis  scnteiicc  sodi  as  i'33)  could  he  irifh'f-nced  bv  ar.  uiiaUcndcd  iirasing  sentence 
suchasc-f): 

i.i  'i  llii.  'i[»v  pul  out  tile  iord:  us  our  srgjeii  l.o  attark. 
i-'  o  iar  lliv  '.pv  extiuguih'std  trie  torcij  ;r.  ;i5r  windov . 

'hr  !  he  spV  Ui  Spl thv'  tOl'i.  n  lii  tllC  SVril'.’e:V. 
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Table  2.4;  Some  studies  demonstrating  interactive  effects. 


Ambiguity  type  Manipulation  Method 

Main  verb/reduced  relative 

Animacy  of  subject  ET  (.fust  &  Carpenter,  1992; 

Tanenhaus  et  al.,  1989) 

Temporal  context  ET  (Truesvrel!  &  Tanenhaus, 

1992) 

Plausibility  SPW  (Pearlrnutter  &  MacDon¬ 

ald,  1992),  RGJ  (Crain  &  Steed- 
man.  1985) 

PP  attachment  (NP  or  VP) 

Context  (#  of  referents)  WS  (Altmann,  1987);  ET  (Britt 

et  ah,  1992),  SPW  (Britt  et  al., 
1992);  SPP  (Britt  et  ah,  1992) 

Semantic  content  of  prior  VP  SPW  (Taraban  &  McClelland, 

1988) 

Coirtent  of  VP  object  SPW  (Taraban  &  McClelland, 

1990) 

Complernent/relativc  clause  1 

1 

Context  (#  of  referents)  RGJ  (Crain  &  Steedman.  1985); 

WS  (Altm-ann,  1987);  ET  (Alt- 
mftnn  et  ah,  1992) 

Adiectival/geiuud 

Content  of  initial  phra.se  LNC  (Tyler  &  Marslen-Wilson, 

1977;  Marslen-Wilson  &  Tyler. 
1987) 

SuhjecLobject 

Syntactic  context  RGJ  (Warner  &  Glass,  1987) 

Semantic  context  RGJ  (Warner  &  Glass,  1987) 

ET  -  eye  tracking,  L.NC  =;  lexical  naming  of  continuations,  RGJ  =  rapid  grarnmaticality  judgment, 
SPF’=  self-paced  phrase- by-phrase,  SPW  =  self-paced  word-by-word,  WS  =  whole  sentence 


In  general,  the  closer  to  the  ambiguity  a  technique  probes,  the  more  likely  effects  due  to 
ambiguity  will  appear  (Foss  &  .lenkins,  1973).  Some  .studies  demonstrated  that  atiibiguity 
effects  disappear  altogether  following  clau.se  boundaries,  supporting  the  the(,)ry  that  mul- 
liple  interpretations  are  maintained  within  clauses,  and  all  but  one  is  di.scarded  at  clause 
boundaries  (Bever,  (Jarrell,  ('(:  Hurtig,  1973). 

Most  of  these  earlier  stuiiies  were  not  locused  exclnsivcly  on  structural  ambiguity, 
as  the  material  m  (33)  iliuslratcs.  f  urthermore,  the  studies  often  used  non  trivial  post 
comprehension  tasks  (e.g.,  sentence  completion  or  paraphrasing)  that  prevented  direct 
.issessmeiit  ol  tire  time  course  and  iialure  of  ambiguity  resolution.  In  contrast,  mucii 
snbsetjiieni  work  specifically  addiessitig  structural  ainliiguity  with  oii  line  tccliniLjucs  has 
yielcied  evulence  consistent  with  the  immediate  selection  of  a  single  structure  (see  (j2,3.1 
aiirl  ( la  a/iei,  I  ‘)<S  / 1 ). 

Reseaichcis  h.i\e  ivccutlv  turned  back  to  trymg  to  find  diid.  t  cx'iUencc  for  slnu  tiiral 
fiaialleiisni  Kurt/rmui  (  Idhs;  demonstrated  clearly  the  coiiqricxity  ol  tiic  fiiu  riomena.  I 's 
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ing  r  apid  grammaticality  judgments  of  sentence  fragments,  Kurtzman  showed  that  multiple 
structural  interpretations  may  be  available  at  disambiguating  points,  but  that  their  avail¬ 
ability  depends  on  a  variety  of  syntactic  and  possibly  pragmatic  factors.  Using  syntactic 
and  semantic  priming  techniques,  Gorrell  (1987),  Hickok  (1993),  and  Nicol  &  Pickering 
(1993)  demonstrated  that  both  structural  interpretations  of  some  ambiguous  strings  are 
available  after  the  ambiguous  point.  MacDonald,  Just,  and  Carpenter  (1992)  showed  thcit 
subjects  spend  longer  reading  ambiguous  regions  than  unambiguous  controls,  suggesting 
that  multiple  interpretations  are  being  computed. 

The  overwhelmingly  puzzling  aspect  of  the  Gorrell,  Hickok,  and  Nicol  and  Pickering 
studies  is  that  they  provide  evidence  for  the  maintenance  of  the  unpreferred  interpretation  of 
a  sentence  that  causes  a  severe  garden  path.  Consider  the  following  example  from  Hickok; 

(35)  The  psychologist  told  the  wife  that  the  man  bumped  that  her  car  was  stolen. 

Hickok  (and  Nicol  and  Pickering)  found  evidence  that  people  compute  the  relative  clause 
interpretation  for  this  sentence,  even  though  it  is  precisely  the  unavailability  of  this  interpre  ¬ 
tation  at  some  level  that  causes  the  garden  path.  (The  garden  path  status  of  this  structure  was 
clearly  established  with  a  separate  grammaticality  judgment  experiment  (Hickok,  1993),  in 
which  sentences  like  (35)  were  judged  ungrammatical  about  99%  of  the  time.)  Although 
all  the  researchers  present  some  possible  explanations  for  this  result,  no  wholly  satisfactory 
and  coherent  account  has  been  developed. 

2»3.5  Theories  of  ambiguity  resolution 

Strategy-based  comprehension 

Preferences  such  as  Canonical  Sentoid  and  Right  Association,  discussed  in  §2.3.1,  were 
originally  developed  as  part  of  a  theoret9:a!  framework  that  might  best  be  termed  strategy- 
based  comprehension.  Strategie.s  provided  the  Hist  theoretical  apparatus  that  separated  the 
performance  system  from  the  competence  grammar.  As  it  became  clear  that  the  transfoi  - 
iTiatioris  of  generative  grammars  in  the  1960s  did  not  correspond  to  pri'cesses  of  parsing 
and  interpretation  (e.g.,  (Podor  &  Garrett,  1967)),  p.sychologists  turned  to  the  perceptual 
mapping  strategie.s  to  carry  the  full  burden  of  explaining  comprehension.  Under  this  view, 
comprehension  consists  primaiily  ol  a  collection  of  interacting  syntactic  atid  semantic 
hcurislics  tluU  map  surface  form  to  some  underlying  structure  (Bever,  1970;  Kimball,  1973; 
Clark  A  Claik,  1977,  Van  Dijk  &  Kinisch,  1983),  I  he.se  heuristics  were  .issuiiied  to  rcliect 
basic  cognitive  or  development  constraints  (Bever,  !97()). 

'I’hougfi  (lie  strategy  ba.scd  approach  has  been  extremely  mlluential,  it  has  tlnee  funda 
uiemai  [iiohlenis: 

1  l>cs])iic  die  geneial  togmtive  oi  develo[)inentiil  lOoii vation  lor  the  .ijjpio.ah,  ifu- 
spec  itu  sirategies  aic  ad  hoc  m  nature. 
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2.  A  strategy-based  theory  t  ^'ten  does  not  make  clear  predictions  and  is  therefore  difficult 
to  test.  The  reason  is  that  each  strategy  is  not  an  absolute  mapping  rule,  but  a  construct 
operating  in  concert  with  a  heterogeneous  collection  of  strategies.  The  interactions 
among  these  strategies  were  never  well-specified,  and  no  complete  computational 
models  were  constructed.  Furthermore,  as  Pritchett  (1992)  points  out,  the  separation 
of  the  strategies  from  a  competence  grammar  results  in  a  curious  situation:  the 
strategies  do  not  depend  on  any  particular  grammatical  theorj/,  but  sometimes  depend 
on  specific  grammatical  rules. 

3.  When  the  structural  .strategies  are  formulated  preci.seiy  and  used  individually  to 
predict  human  preferences,  a  number  of  empirical  problems  arise  in  accounting  for 
glooal  as  well  as  on-line  preferences.  This  was  demonstrated  by  the  Whittemore  & 
Ferrara  (1990)  study  discussed  earlier,  as  well  as  the  interactive  studies  (Table  2.4). 
For  detailed  empirical  critiques  of  several  of  the  proposed  strategies,  see  (Gibson, 
1991)  and  (Pritchett,  1992). 

The  Sausage  Machine  and  Minimal  Attachment 

The  concern  for  the  ad  hoc  nature  of  parsing  strategies  led  to  the  attempt  to  more  carefully 
derive  the  strategies  from  some  underlying  parsing  architecture.  The  best  known  example  is 
the  Sausage  Machine  (Frazier  &  Fodor,  1978).  The  Sausage  Machine  is  a  tw'o  stage  model. 
The  first  stage  is  the  preliminary  phrase  packager  (PPP,  the  Sausage  Machine  proper),  which 
operates  within  the  context  of  a  restricted  six-word  window  and  assigns  lexical  and  phrasal 
nodes.  The  second  stage,  the  Sentence  Structure  Supervisor  (SSS),  sweeps  along  behind 
the  PPP  .ad  structures  the  phrases  into  .sentence  nodes.  Right  A;  sociation  is  not  a  .stipulated 
preference  but  emerges  because  the  fixed  window  restricts  available  attachment  sites.  Any 
remaining  ambiguities  are  re.solved  by  the  principle  of  Minimal  Attachment  (25). 

The  Sausage  Machine  and  Minimal  Attachment  made  two  significant  theoretical  ad¬ 
vances  over  the  earlier  strategy-ba.se i  theories.  First,  they  are  more  clearly  motivated  by 
computational  considerations:  Right  Association  emerges  from  the  fixed  window,  which 
reduces  the  parsing  search  space,  and  Minimal  Attachment  is  formulated  to  keep)  the  short 
term  memory  load  to  a  minimum.  Second.  Minimal  Altachment  applies  to  a  wider  range 
of  cases  than  other  strategies,  which  tend  to  be  sjieciiic  ti'  jjaiticular  strir.'turcs. 

The  theoretical  and  empirical  problems  with  the  .Sausage  Machine  and  Minimal  Auad  - 
nient  are  well  known  (eg.,  (Wanner,  1980;  Abney.  1989:,  Gibson,  1991;  Pritchett,  1992). 
Briefly,  they  mcliide: 

!.  flic  inalnlily  ol  the  six  word  wiiulow  to  correctly  piredict  parsing  preferences  on  slu  rt 
scniences; 

2.  riic  inalulity  !o  aecoimi  foi  ihi' on  line  semantic  .uk!  pfiagniatii dfccts  idciiiihet  ni 

.V  1  he  gictrnmatie.ill'v'  sii.s[:)cet  assumption  iliat  ariiunction  to  .NP  uiiiiormlv  inliodu'.vs 
new  nodes  vs  tnic  adumeiion  to  VP  rloes  not; 
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4.  A  range  of  false  attachment  predictions  that  derive  partly  from  Minimal  Attachment's 
insensitivity  to  argument  structure;^ 

5.  The  vague  appeal  to  computational  justification — while  an  advance  over  earlier  theo¬ 
ries,  MA  was  still  developed  in  the  absence  of  any  precisely  articulated  assumptions 
about  computational  structures  or  processes,  and  in  the  absence  of  a  model  of  com¬ 
prehension  that  goes  beyond  parsing  (Altmann,  1988). 

Nevertheless,  Minimal  Attachment  and  the  Sausage  Machine  set  the  agenda  for  much  of 
the  sentence  processing  work  that  followed.  This  work  includes  attempts  to  characterize 
attachment  preferences  in  terms  of  alternative  parsing  architectures,  such  as  ATNs  (Wanner, 
1980). 

Generalized  Theta  Attachment 

Recently,  Pritchett  (1988,  1992)  has  advocated  a  return  to  strongly  grammatically-derived 
processing  models  of  the  kind  that  were  abandoned  shortly  after  their  conception  in  the 
1960s.  Pritchett’s  theory  of  how  parsing  proceeds  is  captured  in  the  following  statement: 

(36)  Generalized  Theta  Attachment  (GTA):  Every  principle  of  the  Syntax  at¬ 
tempts  to  be  maximally  satisfied  at  every  point  during  processing. 

“Every  principle  of  the  Syntax”  refers  to  the  principles  of  Government  and  Binding  theory 
(Chomsky,  1981 ),  To  illustrate  how  GTA  works,  consider  the  following  example: 

(37)  I  donated  the  gifts  to  the  church  . . . 

The  preferred  attachment  of  to  the  church  is  as  the  second  argument  of  donate,  rather  than  as 
a  modifier  of  gi/A.  GTA  predicts  this  because  the  argument  attachment  maximally  satisfies 
the  Theta  Criterion  (Chomsky,  1981),  which  slates,  roughly,  that  each  thematic  role  must 
be  assigned  to  exactly  one  argument  and  each  argument  must  bear  exactly  one  thematic 
role. 

G  l  A  accounts  for  many  of  the  same  effects  as  Canonical  Sentoid,  Minimal  Attat.hmt  nt, 
and  Freler  Arguments  but  elegantly  collap.ses  these  preferences  into  a  suiglc  prmci[>lc, 
without  many  of  the  linguistic  and  empirical  difficulties  of  Minimal  Attachment.  Though 
(fiTA  bears  a  family  rescinhlaiicc  to  theories  such  as  Minimal  Attachment,  il  differs  fr  om 
its  immediate  predecessors  m  two  important  ways,  f  irst,  the  theory  derives  its  empirical 
[iredictions  from  a  particular  syntactic  theory,  namely  CB.  Second,  the  theory  is:  not  moti- 
vaied  hy  any  extra  grammatical  assumptions  (apait  from  the  (flA  itself)  such  as  short  term 
memory  limitations  or  fixed  windows. 

A  piohk-rn  I'-I.'itcd  U)  iIr‘  i.imisiiitl  .issuni;  ill  oils  about  adjUiK  Iior. :  l)iil  s.-t  .Alaic'  (  1  US'.)  I  for  an  '.-xi  'laiia!  ion 
ul  wliv  moditviri!’  ihioa-  a^sll^llplul^s  icaves  .Miiiirua)  .Allaclimcfii  fnipiin.  aliv  lurrl 

^  itia  natne  the'  lii.loiiuai  li-vcIopiiuuK  of  die  dicoiy  (loiii  one  ■  oiiconu'd  [iinii.iiilv  vmiIi  dK-l.i 

inaitiun  to  oiu'  lhat  appt'.als  lo  ai!  synlaolM.  [aiiin.  ijilrs  ta]ua!lv  '  pitdn  !o  dunk  of  it  .is  (/oroA  ( >ti\  i/i  tinu'tU 
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There  are  two  kinds  of  phenomena  left  unaccounted  for  by  GTA.  GTA  does  not  predict 
Right  Association  effects.  However,  in  this  regard,  the  theory  is  no  worse  off  than  any  other, 
bccau.se  every  previous  theory  must  account  for  this  preference  by  simply  positing  it  as  an 
additional  principle'*.  The  most  serious  is.sue  for  GTA  is  that  (like  other  purely  syntactic 
theories)  it  cannot  account  for  the  on-line  semantic  and  contextual  effects  in  §2.3.3. 

Lexical  preferences 

Ford,  Bresnan,  and  Kaplan  (1982)  proposed  a  theory  of  ambiguity  resolution  motivated  by 
their  data  on  lexical  preferences  (§2.3.2).  The  lexical  theory  posits  that  each  lexical  form 
of  a  given  verb  has  an  associated  strength,  and  verbs  differ  in  the  strengths  of  their  different 
forms.  In  the  case  of  (38),  the  form  of  want  that  takes  a  single  NP  complement  is  strongest, 
while  pc  sitioned  prefers  a  double  complement. 

(38)  (a)  The  woman  wanted  the  dress  on  that  rack. 

(b)  The  woman  positioned  the  dress  on  that  rack. 

There  are  two  important  issues  in  considering  lexica!  preferences  as  a  theory  of  ambi¬ 
guity  resolution: 

1 .  How  do  lexical  preferences  affect  the  immediate,  on-line  attachment  decisions? 

2.  How  do  lexical  preferences  interact  with  other  knowledge  sources  to  arrive  at  the 
initial  or  final  interpretation? 

The  first  is  me  is  related  to  the  genera!  que.stion  of  how  lexical  information  affects  on-line 
parsing  (e.g.,  .see  the  discussion  of  subcategoriza  ion  in  §2.2.  i).  The  data  pre.sented  in 
(Ford  et  al  ,  1982)  does  not  address  this  issue  since  it  concerns  only  the  final  interpretations 
given  to  globally  ambiguous  sentences  (cf.  the  studies  summarized  in  Table  2.3).  As  for  the 
.second  issue,  the  authors'  themselves  note  th:it  context  can  override  lexical  preferences  (see 
i  30)).  Given  this  fact,  and  the  potential  empirical  problems  even  in  “neutral”  contexts  (3 1 ), 
it  is  unclear  just  tunv  important  lexical  preferences  (as  formulated  by  Ford  ct  al.)  are  in  the 
comprehension  process,  fhe  issue  might  be  clarified  if  lexical  preferences  could  provide 
an  cxplanat.oii  ior  garden  rrath  elfects,  hut  this  prove.',  quite  probieniaiie  (see  §2  4.4), 

Weakly  interactive  models 

Altmann.  vfraun  and  Steedman,  motivated  Iry  their  own  empirical  work  on  the  eflects  o' 
context  on  parsing,  have  proposed  a  fine  praincih  parallel,  weakly  ititenu  iive  sentenev 
jiroci'ssiiig  ao  iii’CCture  (( 'rain  A  Sreedrnari,  1985;  Altrnann  A  .str'cdrnan,  19.H.S;  .Sieedinaii 
A  Ailmann.  19K'ki  A  se|iarate  syniaciu-  processor  produces  synt.actn.'  '-iruciures  in  [larallel, 
on  a  word  i>v  v‘.S)r!l  basis.  I  he  pieleiir  ti  analysis  is  seiei:icd  on  die  oasis  ol  sciii.uitic  or 
rctciciitutl  approjii'iatcra'ss.  I  Ik.-  iii'xk'l  is  wetikly  mteoKa'i vc  because  it  inamiainsa  scpiaratc 

-  all  llial  rhe  Ma.  Innc  eoui.;  iin!  [xedu  I  "Itcels  !.ii  ‘.iiup  '.('nlciu  es 
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module  that  analyses  the  input  syntactically  without  regard  to  semantics  or  context.  It  is 
parallel  because,  unlike  the  Minimal  Attachment  model,  multiple  structures  are  produced 
in  parallel  ac  local  ambiguities.  It  is  fine-grained  because  partial  structures  are  produced 
and  ambiguities  resolved  on  a  word-by-word  basis. 

The  model  particularly  emphasizes  the  role  of  referential  context  in  resolving  ambigui¬ 
ties.  This  role  is  captured  in  two  principles; 

(39)  The  Principle  of  Referential  Support:  An  NP  analysis  which  is  referentially 
supported  will  be  favored  over  one  that  is  not. 

(40)  The  Principle  of  Parsimony:  A  reading  which  carries  fewer  unsupported 
presuppositions  will  be  favored  ove;'  one  that  carries  more. 

The  operation  of  (39)  can  be  illustrated  in  the  following  text; 

(41)  A  psychologist  was  counseling  two  women.  He  was  worried  about  one  of 
the  pair  but  not  about  the  other.  The  psychologist  told  the  woman  that. . . 


Referential  Support  will  prefer  attaching  the  incoming  clause  (signaled  by  that)  to  the  NP 
the.  woman  rather  than  as  a  complement  of  told,  because  the  simple  NP  analysis  of  the 
women  fails  to  uniquely  refer,  since  there  are  two  women  in  the  discourse  context. 

This  model  has  much  to  recommend  it,  since  it  begins  to  explain  how  context  can 
modulate  the  parsing  process,  without  leaving  these  effects  entirely  to  some  unexplicaied 
post-first-pass  processing.  The  Principle  of  Parsimony  also  explains  why  certain  apparently 
purely  structural  preferences  obtain  in  neutral  contexts  (e  g.,  the  preference  for  the  main 
verb  over  reduced  relative  reading).  What  is  not  altogether  clear  in  ‘his  approach,  however, 
is  why  any  modular  effects  of  the  kind  listed  in  Table  2.3  should  arise. 


Weighted-  e  vidence  models 

Weight/'d-evidence  models  releis  to  a  class  of  processing  tiicories  that  bring  to  bear  inuili 
pie  knowledge  sources — syntactic,  lexical,  semantic,  pragmatic  -  simultaneously  and  uiu 
lormly  on  each  aspect  ol  the  comprehension  process.  Iii  particular,  syntactic  parsing  and  the 
icsuiution  of  structuiai  ambiguity  arc  potcnliallv  under  the  inllucncc  of  all  these  sources 
Such  models  may  be  called  strongly  mtciaclivc,  or  coirstramt-satislactioii  models,  but 
I  have  used  the  term  weightcii  evidence  to  ernphasi/.c  a  eominon  leatiire.  I'he  supqiort 
tor  alternative  iiiterpretations  is  soine  tunclKm  ol  the  dillerent  sourees  ol  evitlcnee  lor  that 
inlerpivtation  I  tiese  sourees  inay  have  dillerent  weights  or  strengths,  i'rocesMjig  decisions 
enurrge  as  the  lesull  ot  a  conipciition  among  rcpicscntalioio  ol  diUcrciii  mterpretalions, 
bxiiiiijili:-,  ijiclude  the  Hi:Ai»;R  model  ('!'!iibad*-;;u,  Jnsi,  A  Caipentei,  ki  aiu  R  is 

h;,',ed  on  the  t'.Al’.S  ar''liitecviirc .  wha  ii  is  an  asiivatioii  based  [irodurUioii  svsleui  (.lusi  A 
( 'arptoiiei,  !  98  / )  I’lotiuciious  {lepiesentmg  dii lei eiit  knowledge  s( »im;es i  ducL  t  tU  !i  \  al mn 
to  leprescntalions  ol  die  iiiinit.  'I'he  .actissUion  oi  sirengtli  o  an  iiiteipietaiioii  is  i  (uiipiiteii 
In  Si  I  inn  ling  (he  aslis  .limn  ul  ail  the  piodui.  tr.  ms  siippoi  tiiig  lh„!t  iiilei  piclal  ion.  1  he  aiiioum 
ol  .iclnstfion  asso'Siatcd  with  each  produi  t;()n  i.s  in  linn  a  liiiK'lion  ol  die  slii.'iiglh  u;  tlu 
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particular  production,  and  the  activation  level  of  representations  satisfying  the  conditions 
of  the  production. 

The  connectionist  model  of  fSt.  John  &  McClelland,  1990)  implements  similar  ideas, 
though  the  network  does  not  have  distinguished  bits  of  structure  corresponding  to  pro¬ 
ductions.  The  St.  John  and  McClelland  model  also  differs  from  the  READER  model  in 
an  important  and  informative  way;  it  does  not  compute  a  separate  syntactic  structure  (cf. 
§2.1.3).  The  theory  thus  adopts  a  more  radical  form  of  nonmodularity.  This  is  an  indepen¬ 
dent  choice  which  is  not  inherent  to  weighted-evidence  or  strongly  interactive  theories. 

A  central  concern  for  these  models  as  theories  of  ambiguity  resolution  is  the  difficulty 
in  making  specific  predictions,  since  essentially  any  knowledge  source  may  be  posited  to 
affect  a  choice.  There  is  a  need  to  continue  the  development  of  a  knowledge  level  theory 
(along  the  lines  of  (39)  and  (40))  that  will  make  predictions  across  a  range  of  ambiguity 
situations  with  some  generality. 

Of  course,  another  problem  for  strongly  interactive  theories  is  explaining  why  modular 
effects  show  up  at  all.  The  Capacity-Constrained  READER  model  (CC  READER)  (Just  & 
Carpenter,  1992)  provides  one  interesting  solution  to  this  problem.  In  CC  READER,  modular 
effects  arise  because  of  limited  working  memory  cap  -city,  which  corresponds  to  activation 
in  the  model.  Activation  limits  may  prevent  all  the  relevant  knowledge  sources  from  being 
brought  to  bear.  Furthermore,  these  limits  are  hypothesized  to  differ  across  individuals, 
predicting  individual  differences  in  modular  effects  Although  which  knowledge  sources 
suffer  as  a  result  of  WM  limits  is  still  a  degree  of  freedom  in  the  model,  the  model  does 
show  in  principle  how  an  otherwise  strongly  interactive  theory  can  exhibit  modular  effects. 


Constraiaed  parallelism 

Parallel  models  of  ambiguity  resolution  do  not  ncccssariiv  select  one  structure  at  an  !im- 
biguous  point,  but  fAvnul  ll'c  explicit  nuiintenaiice  of  multiple  structures  dming  the  parse. 
The  challenge  in  such  a  framework  is  to  tnul  ways  loconsuam  (lie  parallcbsu!  -o  (hat  the 
uiiuihi'i  ol  interpretalious  d<H‘s  luU  grow  combinatoi i.diy,  arrl  :■<'  tj.  tt  fmur.ui  [scHoiinance 
is  modeled. 

file  limn  on  a,(,-ti'.  aliou  in  (he  ( '( '  Kl-.''..i)l  model  (.■ou'-ii  aius  us  [mi a' lei  s!  .s  ( .Mar' 

Dorialii,  .tiisl,  A;  i 'aipiaUv!,  if  fin-  lunii  is  icaciKal  aJuls'  i uauiiainijii'  nailiiplr 

st: uctiues,  ol  the  .sinu  luie.s  is  liiss aided  (o  tree  up  is.'soui-..i‘N.  i  is.:  m  alci  aci  'aars's  ivM 
.senianlie  and  prai'iuaiic  [nelcreiiees  fiv  [iiping  acnvaiion  to  the  more  p!e!eiu.-ii  ,su  ncuiic  .,  .w 
deseiibcti  abo\c.  .Again,  while  llie  basu.'  .sAuetuie  is  ui  place  'oli.,!iuiie  a  ranr'e  o!  aafhig'uri 
re.soluiioii  <.'l(ce|s.  one  of  (tie  ditfh  tilncs  wiit;  (iae  model  is  itial  tiieic  aie  maio'  degrees 
of  tna.aioiii  in  llie  sUategies  loi  ’.'.'oikuig  iiieniorv  maiiageniei't,  so  it  is  dillieiill  lo  make 
detailed  piediciieais 

.A.  senes  of  iiioie  sviitaelie.ilh  i)!  u'ljicd  theoi  les  lias  been  deseioped  In  Km  1/maii  (  I  osa  i,. 
(ioiiell  ||0S/),  ao  I  (  iiosoii  tpa.Ai  riie-.:-  models  laiik  paiallel  allei  nai  is  es  ptini.eii'v 
aieoniini’  to  siiiK  iui.i!  le  idiies,  .As  in  dieei'  Rt  .-M'IK  model,  Ims  [iiel'M it'd  sinu  iiue-,  a.i,- 
eoiitiiMioiisi pi  lined  (  i ii  'son  Ueo: loped  a  [>i et  ise  meli  u  for  assiginim  '  .  sm.  '  - 1  ,! i  u,  t n :  .il 
aliia iKil i\ es,  p,.uil\  denseri  In  lu  ;niiu  i|iles  of  (  dt  sviil.is,  A  siiii  ei;  ;  pin.ied  i!  'O 
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existing  structure  is  cneaper  by  some  constant  factor.  Empirically,  the  model  is  an  advance 
over  other  parallel  theories  of  ambiguity  resolution,  because  it  is  able  to  make  detailed 
predictions  and  accounts  for  a  range  of  structural  preferences.  The  Gorell  and  Gibson 
models  both  allow  for  pragmatic  and  semantic  effects  to  modulate  the  preferences,  but  this 
part  of  the  theory  is  not  worked  out  in  much  detail  (in  contrast  to  cc  reader). 

The  theories  above  demonstrate  that  parallel  models  are  capable  in  principle  of  exhibit¬ 
ing  modular  and  interactive  effects,  and  predicting  structural  preferences.  But  as  mentioned 
in  §2.3.4,  parallel  models  do  not  automatically  account  for  the  complex  phenomena  sur¬ 
rounding  parallelism  in  humans. 

Deterministic  parsers 

The  PARSIFAL  system  of  Marcus  (1980)  w'as  the  first  attempt  to  build  a  model  of  parsing 
that  was  strictly  deterministic.  Marcus  proposed  the  following  definition  of  a  deterministic 
parser; 

1 .  All  syntactic  substructures  created  by  the  parser  are  permanent. 

2.  All  syntactic  substructures  created  by  the  parser  for  a  given  input  must  be  output  as 
part  of  the  str  ucture  assigned  to  that  input. 

3.  The  internal  state  of  the  parsers  must  be  constrained  in  such  a  way  that  tetitporary 
syntactic  structures  are  not  encoded  within  the  state  of  the  machine  (no  simulated 
non -determinism). 

Strictly  deterministic  parsers  are  simpler  and  more  ellicieiit  than  iioudcteniiinistic  parsers 
since  they  disperrse  with  the  addrtional  mechanisms  ret|uired  to  maintain  parallel  stales  or 
[vriorm  backtracking. 

Marcus  noled  that  a  determirir.strc  parser  that  is  forced  to  make  clioices  immerlialely  at 
ambiguous  points  lalls  sliort  ot  the  apparent  human  capacitv  to  luiiidie  ambiguities,  overfue 
dicimg  gaideii  |>alli  etlecrs,  f-oi  tins  reason.  P.VK.sii  .Xi  nsi-s  lookahead  !o  del;i\  .illachmeii! 
decisions,  flic  look,rliead  is  supportiai  try  a  three  cedi  butler  dial  liokls  I'onMitiiiaus  av.  ar.iiig 
air.ii  timeiil  .\tlai  Iiiik'iiI  decisions  aie  made  b\  palfern  action  rules  sciisiIim'  In  ihe  luiuie 
s\  iil.iclK,  coiiIeM, 

Ihe  ii.d.il  I  \  e  si  iii|d  Kit  \  ;iiul  rd  ticieiir  v  ot  delei  null  otic  arch  itecin  res  makes  them  .111(10.1 1 
Ml'.'  c.Ui'h'IaU's  Irri  pscclioluiguisiic  i.ioileK  lloweM'i,  ,ss  a  model  ul  ambieuils  lesoluiioia 
MvS'll  iias  i\M>  laiih  serious  shottcoiiimgs  l  ust,  ihe  wsiemaiie  dida\  of  aUachnu.'iil 
lieci.aviiis  IS  iiK  onsistc'i!  with  the  ai  cuinulatei.1  es  ideiice  111  'mppoil  cl  s\  niacin  mimcih.!,  \ 

■'  I  I  .SCI  Olid,  ihc  sii.iier’ies  ( iidc'.l  ii'.ed  to  icsuhe  anibigiiilics  .lie  puieU  ■.n  iiI.u  Iu  .im! 
ilici.'lore  siibiec!  to  the  -.niie  cnlK  isiii  ol  all  s\ iiI.k  IK'  lesohilion  Itieoi  les  the  ni.ibilits  to 
iccoiiiii  ioi  the  mtei.iclixc  cllec's  liiH  uiiu  nted  III  T'  '  '  !n  piuieiple,  tio'.s  e\  ei  it  '>hoii!d 
I H  pi  1 .,  .ilde  to  ,'i  insl  I  Ik  I  deternimeaie  look.ihe.ul  1  .'ode  K  I  li.i  ,11  e  at  le.e.t  \ce,ik  K  mlei  .leu  \  e 

.S uiisi '‘.|iieMl  \S  oi  k  b\  M.ii cii'^  I't  .il  i  I  OS  '  I  led  io  .1  \  .iriai it  ot  delei  11  imi  si  u  p.ii  sei  known 
,e.  .e.'i's'.'.'ss,'/  I  ,  'rntm'in!  n:  p.nscis  (Wemhcig.  loop  (.loiiell.  fO'-i  O  \1iim!!,.il  i  oimnilmeni 
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(a)  (b) 

D(y,x).  D(y.w)  D(y,x],  D(y,w}< 

D(q.w).  D(q,p) 

Figure  2.2:  Dominance  relations  in  minimal  commitment  models. 

parsers  dispense  wirh  lookahead  and  instead  adopt  a  representation  of  syntactic  .structure 
known  as  D-theory\  which  permits  underspecification.  The  key  is  building  trees  with 
dominance  relations  that  do  not  encode  strict  dominance.  In  this  way,  the  parser  can 
minimally  commit  to  structure.  Figure  2.2  .shows  how  this  works.  The  representation  of 
the  tree  in  (a)  can  be  changed  to  a  repre.sentation  of  the  tree  in  (b)  by  simply  adding  a  new 
dominance  relation.  The  original  relation  D{y.  vv)  does  not  a.s.sert  that  y  directlv  dominates 
H',  so  any  number  of  fihrasal  nodes  can  be  added  later  between  y  and  w,  as  long  a.s  v  continues 
to  dominate  ir. 

Since  miimnal  commitment  theories  do  not  u.se  lookahead,  they  offer  the  possibility  of 
addressing  immediacy  of  mterpretatic'n.  Weinlierg  ( 1  Wl )  assumes  that  the  representations 
are,  m  fact,  mimediateiy  interpreted.  But  this  requires  determining  which  syntactic  relations 
.ictii.illv  hold  Beciuise  the  dominance  relations  do  not  explicitly  specify  which  relations 
hold,  additional  computation  must  be  |)erfornied  to  make  the  immediate  dominance  relations 
explicit rtiese  computations  are  purelv  svnt.ictic  in  natuie,  since  they  process  and  produce 
piirelv  syntactic  repi eseiitations.  Wh'*iher  this  process  is  considered  part  o^  the  parser 
ilselt,  or  a-.sigiied  lo  the  semantic  mterpietci  an  odd  partioning  ot  tuiiction  ueterniimsm 
IS  vmlated  since  the  -.li  iictmes  protlik  ed  tor  mtcrji'ctaiion  may  change  nonmonotonicallv 
till oiighoiit  the  p.iise  Ihus.  llie  mnmn.il  lommilmenl  morieis  stdi  ha\i‘  not  reconciled 
dctermimsni  with  immedi.icv  1  urihermoie.  these  iiioiiels,  like  their  [nedecesso! s,  adopt 
jnuelv  sMit.iclic  lesohilioii  stiategies,  le.ulim’  to  the  problems  discussed  above 

"'In  ''I'lKi.ll,  lliis  III. IN  iciiiiire  luiiifniniii'  ihc  li.uisilise  Uomik-  ot  llie  lionnn.uu  e  irl.ilimiv  1  <■!  /(I  \  I 
iiKMii  III. 11  i  I iiiiii(‘di,il,-l\  ilomm.i'cs  \  I  1  hesr  .lie  Itic  icl.ilions  ir,juiic\l  lo\  mi'.'i  pin.ilion  i  I  hc  uoinin.iiK  C 
;c!.i!u'!is  I  ,111  hr  l.iKrn  .n,  .c,', cm-. Mis  in  Ihr  |<i'-iiuMic  r;iK-iihis  Uicn  :hc  followoi-'  .I'.ioiii'.  will  Mitiur  to 
I  ,,  'ini'inc  iiniiicii '  w  ^  i  cl.ii  ons 

.  \  \  .1  h  \  \  ■:  oi  :  I  >!  \  >1  1  ■  /  lor  I '  l<  X  V). 

.  \  /  r.  >  \  X  I  ■  1  h  1  , '  ; 
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In  this  section  we  review  the  range  of  garden  pain  (GP)  effects  and  unproblematic  ambigu¬ 
ities  (UPA)  The  section  concludes  with  a  review  o(  previous  garden  patlt  theories.  Though 
GP/UPA  phenomena  are  so  closely  related  to  ambiguity  resolution  (§2,3)  that  they  miglit 
properly  be  considered  subphenomena,  I  treat  them  sepaiately  for  two  reasons.  Pir.st,  die 
richness  of  the  phenomena  deserves  focu.sed  aUontiun  of  its  own.  Second,  a  theory  ^.t 
ambiguity  resolution  is  not  automatically  a  theory  of  garden  path  effects.  In  other  word.s,  it 
is  possible  to  have  a  good  theory  of  ambiguity  '■esolution  without  ha  ring  a  good  theory  of 
GPA.,1PA  effects,  and  vice  versa.  This  should  become  clear  in  §2.4.4. 


2,4.1  Garden  path  phepomena  defined 

A  garden  path  effect  arises  when  a  reader  or  listener  misinterprets  a  k  c.al  ambiguity  in  some 
way  and  is  unable  to  recover  the  correct  'uternretat'or,  without  reprocessing.  The  result  is 
an  impression  that  the  sentence  is  ungrammatical  or  does  not  make  seniC.  There  are  two 
kinds  of  garden  path  effects,  as  shown  in  (42)  and  (43 »; 

(42)  The  cotton  clothing  is  made  of  grows  in  Mississippi,  (cf.  The  cotton  that 
clothir'g  is  tnade  of  prows  in  Mi.ssissipp-  ) 

(43 )  (a  ,1  War  Worries  Dog  Consumers  (cf.  Warworrits  ere  dogging  consume  i.%) ' 
tb)  The  old  man’s  glasses  were  fihed  with  sherry. 


Syntactic  garden  paths,  .such  as  (42),  arise  becaum  the  disambiguating  information  is  grain- 
iriatical  in  natute,  and  the  structure  assigned  to  the  ambiguous  maiertaJ  canuo!  be  gram 
matically  incorporaied  into  the  remainder  of  the  .sentence  Syntactic  GPs  give  iTse  to  an 
imoressiou  ol  ungrarnmaticality.  .Non.syn.taciic  garden paih.s ,  such  a,s  these  m  (43),  arise  be¬ 
cause  the  disambiguating  infcrmaiion  is  semantic  or  pragmatic  in  nature.  The  iu.crpretation 
and  siru'.  ture  assigned  so  the  ambiguous  material  can  be  grainimuically  incorporated  into 

'he  remainder  of  the  sentence,  bu.  not  without  creating  a  sci.nasitic  c-r  pragmatic  anoma.y . 

'■•itcii  lo  humorous  effect  I  here  arc  .several  kind  .  of  nonsyntactic  gaiclen  paths,  depenuing 

'  whether  the  initial  anibiguity  is, structural  (4  Ti)or  semantic  (43b)  I  .shall  have  li'tle  rncie 
iu  sa\  about  m:nN\'!)UH,tic  GPs  since  ilu'y  have  not  liecii  well  studied.  la;r  Ib.e  remuindei 
of  die  thesiS,  tim  term  gu.mV/i  path  refers  (<)  syntticuc  garden  path  cftecis  unless  otlicrwise 
itmed. 

!  he  dermuon  ot  <  iP  given  ahreve  is  not  uiusersally  adopted  iii  the  psyad.ioluiguisia.' 
iricianu'’,  though  it  is  iaulv  eor.iinou.  Another  frequmii  use  ot  flie  tcru!  is  t!i  uder  to  anv 
intvsijfabic  e!i:'cf  that  jeschs  to.m  suirjeeis  inaK’ag  a  wiong  clKiue  ai  a  local  les'nguiiy, 
scvaidles-  ot  wiuShei  d'e  luisiiiteipieiai.ou  give  use  fo  mipicssions  ol  un-n .njuu.aiieala'y, 
A  tide  diosc  e'lccts  aif  la-Oainiv  mlerestuigas  well,  I  r  oiitimie  ‘o  ii',!:  tfie  stoaigci  Uennition 
h'.a  au\c  li  einfituc-i/cw  .in  enijuncallv  ineasuiatr'e  .aid  dr, on  in  all’',  U!i!)Oi!:int  ihsiUii i lo  i 

’  i  'iSi,  .  ■ .  i.  1 1  J  :  iu  .V.  V!  h’f r t  ; .  a.tl;'  niikni ,v'.  II 
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The  role  ‘if  garden  path  effects  in  psj't  hoJinguisUcs 

GP  effecis  have  played  a  dual  role  in  psycholinguistics.  Most  commonly,  'hey  have  been 
used  as  a  diagnosiic  for  exploring  strategics  of  ambiguity  resolution,  becau.sc  a  GP  effect 
O'  strong  evidence  that  a  subject  chose  a  particular  path  at  an  ambigiiotis  point.  However 
the  GP  effects  themselves  have  seldom  been  systematically  explored.  The  result  has  been 
a  sabstaatiai  body  of  ambiguity  reso.!ution  theory  (§2.3)  wiil'  few  truly  adequate  accounts 
oi  GP  effects  per  se.  This  general  point  has  also  been  emphasized  by  Pritchett  ( 1992). 

Plvidence  for  garden  path  effects 

respite  the  fact  that  GP  effects  have  been  somewhat  neglected  in  the  field,  there  exists  a 
fair  amount  of  compelli.ng  evidence  for  a  range  of  GP  types.  There  are  two  .fea,sons  for  tliis: 
(he  perceived  ungrammaticalify  of  GPs  maJccs  them  easily  subjvXt  to  linguistic  intuition, 
and  the  raft  of  experiments  on  ambiguity  resolution  have  left  behind  a  gold  mine  of  data 
relevant  to  GPs. 

■^'ables  2.5,  2.6,  and  2.7  pre.sents  a  collection  of  GP  effects  cias>i.'’ed  by  syntactic  type. 
(The  syntactic  classification  is  meant  to  be  descriptive;  the  use  of  traduional  grammatic',:! 
relations  '"houlu  not  be  taken  as  a  theoretical  commitment  to  a  relational  or  functional 
gtamniai  .)  The  stronge.st  source  of  empirical  evidence  comes  fiom  rapid  grammaticality 
judgments,  where  subjects  indicate  whether  they  tltink  a  Sv.ntence  or  sentence  fragment 
is  gran  natical  within  a  few'  hundred  milliseconds  of  reading  it  Reading  times  provide. 
■idd.iticc;'i  evidence,  .ou'"  must  be  intefp.reted  with  care,  since  reading  times  alone  do  not 
alvva’-  Ji;;ungi!!sh  GP  from  :ion-GP  ca,ses.  No  GP  type  is  hsted  solely  on  the  basis  of 
reading  tines. 

V-'tien  e.^amples  of  diese  garden  path  types  are  pre,sentsd  in  the  text,  die  GP  number 
from  Tables  2. 5-2. 7  will  be  appended  to  the  example  monber  as  fo!>'o\vs: 

(44;  GP!)  The  businessman  sent  the  {tr.intings  yesterday  was  upset. 


2.4.2  UiiprobJernafic  a!inbtg:u;tj‘.;s 

If  the  story  ended  witn  GP  eifecls,  thvie  would  be  no  diincult  ihcore'ica!  problem:  any 
smj.le  jiatii  detciministic  pcr'.:er  would  make  the  right  piedictiofiS.  However,  flaicus  ( (930; 
p  iintctl  out  that  diere  ue  ambiguous  stmetures  which  do  uot  c.use  people  difficu!!.’/  no 
matii'r  winch  intcrnretalioa  of  the  amb^gnity  prtives  corre-..;!  Taolcs  2.8,  2.9  ma.l  2, 10  li.it 
a  rang?  ol'  such  imp: obicmalsc  ami>iguities.  bd’.h  examples  come  in  pairs  Tac.h  sentence 
sn  she  pan  it'qi'iic-.  'h?  local  ambiguiiy  so  be  mterpicted  ».'(  a  difter-  nt  amy,  t)'ii  botli 
i-?;Uc.K'';s  are  aciieptable.  Ail  tie  types  involve  i,tiucu;ra!  arr.biguity  of  muih':  kind,  wnh 
tlie  csccrtion  of  opAur  and  UIAtP  wliah  wen:  luciu’ied  lO  illustnge  ihc  ssisid'iciency  or 
:i?ican!:c  (diarL'cniari  v  itu  .\)1(>;  antbiguiiv  to  cause  OP  ei'lect-,, 


Giinicti  pMh  cdvcts  :ind  ur-problcniutic  ambiguities 


[able  2,5:  A  collection  ot  gorden  path  construct ion,s  (pLrt  1  of  3) 


Type  t'xAM'PLE 


C-P!  Direct  object/s'ib;„.ci 

(Frazser  &  Rrsyiv'r  !98.i,  Pritcl.eti, 
19>:8) 

GP2  Direct  objectysuhject  (long) 

(Warner  &  Glass.  1987) 


GP3  Complement  clause/subject  sentence. 
(Gib, son,  1991) 

GP4  Direct  object/siibject  with  embedded 
reia.'ive 

(W;  .met  &  Glass ,  1 987) 

<1P5  Edrect  obiect/  ubject  with  relative 
clause 

(Wan  er  &  Gl;.ts,s,  !  987) 

CiPCi  Preposition  ooject/st  Eject 

(Fruner,  )  978;  PritcErit,  1988) 

GP7  l,>irt;c!  object/subjec!  e  t  second 
(.  oinplenient 
(e'nichelt,  1988) 

GP:-:  Duvet  t)[>|ec,/;>ub!(;\-i  of  s  "cond 

I'venplesnent 

v  'Oioii.o  (iiH!  contpleti  en  i 
sF^Kviieti  !98Hi 


.Since  Ja,  always  jog;4  a  mile  seems  like  a  sh(,.;t 
iistaitce  lo  tion. 

vvE  Since  Jay  alv’ays  jogs,  a  n'ile  secsts  like  a 
.Si'.orl  distance  to  him.) 

The  girls  oelieve  the  man  'vho  believes  the  ■  ery 
strong  ugly  boys  struck  the  clog  killed  the  cats, 
(cf.  The  gids  believe  that  the  man  who  belidve.s 
that  the  very  strong  ugly  boys  struck  the  dog  killed 
the  cats. ) 

1  believe  that  John  sniokes  aimoys  .Vlacy 

(cf.  I  believe  that  John 's  smoking  annoys  Mary. ) 

Before  the  boy  kills  the  man  the  dog  bites  sirikts, 
(cf.  Before  the  boy  kills,  >h"  m,tn  rhotth'^  dog  bites 
strike.’'.) 

When  the  horse,  kseks  the  boy  the  dog  bites  tiie 
nan. 

(cf  Wne.>t  horse  locks  the  hoy,  ih'-  'log  bites 
the  mail.) 

W-.tho.U  her  coutrioations  failed  to  v,on(e  in. 

(cf.  Wthout  her,  contributions  Jaded  to  tome  m  : 

1  ti'pvinced  iier  professors  hate  rne. 

( .T.  ./  convinced  ho  that  arefestors  hcie  rne.) 

The  dvicior  warned  the  patient  vvotild  be 

COViia^i  >ilS. 

(ca.  H'ar/ieJ  iii-a  .'tc  fji-; 

r*riiUdioii t 
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jlumur,  !u‘:-<tei'>cc  Cc mpiciKrsion:  llicnofnenn  and  Freviou.;  Woi'k 


Table  2.6:  A  (.■.ollec'.ion  of  g'lr.ten  path  con.st'.uctions  (part  2  o:f  3). 


Example 


Type 


Indirect  object.' sabj-ict  of  rclacivc* 
(Wi.aine,'- et  al..  1975;  Pritchett,  1992) 

GPttJ  Erribeddi.d  oi'ject/uiarr'x  object 
(l^itchet;  19'^?.) 

CP'M  I  Cotnpleinen*  c!:m;  e/relative  cLace 
(Cram  ct  Stect'man  1985) 


G''’12  PP  ergument/adjunct 

(GibsK-n,  199.(:  Pnxchett,  1992) 

GP !  3  Relative  clause/cornpleiaent  clause 
(Crain  2  Sleedrnan,  3  9K5  ; 


GPb’  .Main  ^'t•rb/Icdtced  relative 
(Bever,  1970'' 

GF.‘5  Main  rerlV'.'edui  ed 'eiatjve  (short) 
(FCi'itzman,  1983;  /.bnty,  '989.i 

Ditraji.siti  ve  niaiii  vevb/rcduced 
relative 

(Raynci  ct  ai.,  1 983) 

Alain  \';’rb/en. bedded  relative 
(CMb'.oa,  d)yi) 


John  gave  the  boy  the  dog  bit  a  dollar. 

(cf.  John  gave  the  h-'A;  that  the  dog  nil  a  aollac) 

Sue  gave  the  man  who  was  raciii,g  the  ear. 

(cf.  ^he  gftvf  the  car  to  the  maa  who  wac  racing.) 

The  psyfiioLyyisi  told  the  \yife  that  he  'va.s 
iraving  trouble  with  to  leave. 

(cf.  The  piychologis.  told  the  wife  who(m )  he  wcu 
having  trouble  with,  to  leave.) 

I  .sent  'he  letters  lo  Ron  to  Rex. 

(cf.  I  sent  the  lettei.t  frooi  Ron  to  Rex.) 

'Ihe  psychologist te'd  the  wife  that  he  wtts  having 
trouble  ndth  her  husband,  (cf  Jne  psychologist 
told  the  wife  theu  he  vrc.''  having  trouble  with  to 
leave.) 

The  horse  raced  past  the  bacn  fell. 

(cf.  The  car  driven  past  .‘he  barn  .stalled.) 

'(he  boat  floated  sank. 

(c:'.  Ihe  car  driven  stalled.) 

The.  woiiian  rroughi  the  tlowers  smiled  broadly, 
tef  The  ivomcn  given  'he  flowers  s/niiea  hroadly.) 

Ihe  d'.g  that  vv.is  led  next  to  tiie  cat  walked  to 
the  park  rhewed  the 

id',  the  dog  that  rvc'.v  fed  nest  to  the  cat  seen  tv 
the  f.-ov  chewed  die  hone.) 


GFIT 
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Table  2.7:  A  collection  of  garden  path  constructions  (ptnt  3  of  3). 


Example 


"Type 


GPIS  Adjectiye/poan  followed  by 
noun/ verb 
(Milne,  1982) 

V.-P1 9  N'oUhj''vcrb 

(M'lne.  1932) 

GP20  Adjective  nouii/fioun-reiativc 
(Marcus,  ’  ‘980} 


(,,i ?2  i  Subject/'verh  (deri  vcd  nominal) 
(Fritcneti,  >992,MiLw.  1982) 

GP22  Predicate  conip!  'ment/rsubject 
(Ford  et  al.,  1982) 

C3P23  Thai  'ornpleraoiUizer/pronoun 

t;r'24  I  hat  compicnuintiiicr/dcterPLine,;' 
(Gibson,  1991) 

GF/f  Thai  compleipentizer  for  subject 
sepserK'.e/detcr  miner 
(Gibsoic  199!) 

Gi’26  Main  v\nl'/:'m.\ilu>ry 

( Kui'tonan,  19£.'.';  Maxiis,  I9SQ; 


The  brilding  blocks  the  -mn  faded  aie  red. 

(cf  The  h'o'-hs  th.at  >hc  '■:un  faded  are  red.) 

The  granite  'oc'k.s  by  the  seashore  with  die  waves, 
(cf.  The  grande  gently  rocks  by  the  seashore  '^'hk 
the  wo  ve.s. ) 

The  ccno.r  riothiiig  is  made  of  grow's  in 
M'ississipjT.. 

i  cf.  'The  cotton  that  clothing  i.s  made  of  grows  in 
Mississippi) 

The  old  train  the  young. 

(cT  The  older  folks  train  the  younger  folks.) 

The  boy  got  fat  melted. 

(of.  The  hoy  .^ot  butter  melted.) 

Refore  she  krK=iw  that  she  went  to  th>^  store. 

(cf.  Before  .she  knew  that,  she  went  to  the  store.) 

I  saw  th,a';  white  moose  are  ugly. 

(cf.  J  ,vaH’  that  cats  are  ugly.) 

That  coffee  ta.ste.s  teirible  surprised  John. 

(cf.  ft  .surnri<^(.-tj  Joh.%  that  txffee  tastes  tenibli.  l 

Have  the  boys  gr/en  gifts  by  their  friends. 

(cf.  T/ave  the  .boy's  friends  give  gifts  to  them.) 


i: 

I 

i'i' 

'J*V 

” 

Chiipicr .'’.  llumut!  Sentence  C( 

'inprehension:  Phcnuiucnn  and  Previous  Work 

'N 

[,• 

i 

I'AfiLK  2.8,  A  coUection  of  unproblematic  anibif,Tiities  ^part  1  of  3). 

'X''C 

'il' 

Type 

Example 

iv 

UPAl 

D I  rec t  ouj  ec  t/su  bjee  1 

I  knew  the  man. 

« . 

(KinfoaU,  !973;  Ferreira  &  Header- 

I  knew  the  man  hated  me  passionately. 

son, 1990) 

Uf'A?- 

{3ircct  object  o!  fronted 

When  the  boys  strike  the  dog  kills. 

Ai^ 

tJause/siibject  (short) 

(Warricr  &  Glass,  1987;t 

When  the  boys  strike  the  dog  the  cat  inns  away. 

V.f: 

UPA< 

Direct  cibjecMii'.bject  (long) 

Ron  believed  the  ugly  little  linguistics  professor. 

(Pritcheu,  1992) 

Ron  believed  the  ugly  little  linguistics  professor 

'V^"t 

v^-i 

he  had  met  the  week  before  in  Prague  disliked 

him. 

IIPA4 

NP/NP  .specifier  (Pritchett,  1988) 

Without  he'"  we  failed. 

W'ithou?  her  contributions  we  failed. 

i 

UFA5 

Plural  .MP^NP  specifier 

The  w  oman  kicked  her  sons. 

■'.'Ji 

v.\^ 

(Fatchett,  1988) 

The  woman  kicked  her  sons  dogs  houses  doors. 

I 

1 

•  *•■ 

UF'A'i 

Second  objcct/spscifier 

The  cop  gave  her  eanings 

(Gib'-'ot.,  1991) 

The  cop  gave  her  em'rings  to  the  dog. 

UPA7 

PP  argurrtent/argumenl 

The  minister  warned  the  president  of  the  danger. 

:'3 

(Gibson,  199!) 

The  minister  warned  the  president  of  the  republic 

V- 

'4'.' 

of  the  danger. 

’■■A' 

UI'*/-,  8 

Pretiicatc  corripieniens/NP-modifier 

Is  the  block  in  the  box’’ 

1 

(M.arci!s,  1980) 

D  the  block  in  the  box  red 

1 

UFA9 

Conipiesnentysubiect  relatjve 

John  told  the  man  that  Mary  kissed  Bill. 

1 

{(Sibson,  1991) 

John  told  the  man  that  kissed  Mary  ’hat  Bill  saw 

- 

'1 

Phil. 

;’s 

MPA  10 

NF  coiriplenien'/relative  clause 

The  repor'  !hal  the  president  sent  to  us  helped  us 

S; 

(Gih.son  199!) 

make  the  decision. 

The  report  that  the  president  sent  tSic  troops  into 

i 

•> 

^ ' 

. . . 

.  . . 

combat  depressed  me. 

»'.! 

i'* 

i 

% 

<i 
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FabUl  2.9:  A  collection  of  unproblematic  ambiguities  (part  2  of  3). 


Type 


Example 


UPA I !  Predicate  complement/adjective 


UPAI2  Main  verb/rsduced  relative, 
obligatory  object 

(Pritchett,  1988;  Ferreira  &  Clifton, 
1986;  Just  &  Carpenter,  1992) 

UPA  13  Reduced  relative/main  verb 

(Gibson,  1991;  Pritchett,  1992) 


UPA  1 4  Modified  main  verb/reduced  relative 


UPA  15  Compound  noun  followed  by 
noun/verb 

(Frazier  &  Rayner,  1 987) 

UPA  16  Noun/auxiliary  verb 
(Gibson,  1991 ) 

UPA  17  Adjective/noun  followed  by 
nouii/verb 
(Milne,  1982) 

UPA  18  Noun/adjective 


UPA  1 9  Co.iiplcment/adjccti  ve 
(Pritchett,  1988) 

Derived  Moininal 
(Milne,  1982;  Pritcheti,  1992) 


The  boy  got  fat. 

The  boy  got  fat  mice  for  his  pet  snake. 

The  defendant  examined  the  evidence. 

The  defendant  examined  by  the  lawyer  shocked 
the  jury. 

The  bird  found  in  the  room  died. 

The  bird  found  in  the  room  enough  debris  to  build 
a  nest. 

The  defendant  carefully  examined  the  evidence. 
The  defendant  carefully  examined  by  the  prose¬ 
cutor  looked  nervous. 

The  warehouse  fires  numeroirs  employees  each 
yeta. 

The  warehouse  fires  kill  numerous  employees 
each  year. 

The  paint  can  fell  down  the  stairs. 

The  paint  can  be  applied  easily  with  a  new  brush. 

The  building  blocks  are  red. 

The  building  blocks  the  sun. 

The  squam  is  red. 

The  squaie  table  is  red. 

I  like  green. 

I  like  green  dragons. 

Tlie  old  teai  h  very  well 
The  old  train  is  big 


UPAZO 
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Cluiptcr  2.  llu'.uitn  Sentence  Comprehension:  Phenoincnn  ami  Previoir  Work 


Table  2.10:  A  collection  olTinproblematic  ambiguities  (part  3  of  3). 


fYl’E 


Example 


UPA2 1 

That  pronoun/determiner 

I  know  that. 

I  know  that  hoy. 

UPA22 

That  pronoim/complenientizer 

1  know  that. 

I  know  that  dogs  should  play. 

UPA23 

Singular  noun/plural  noun 
(Kurtzman,  1985) 

The  sheep  seem  very  happy. 

The  sheep  seems  veiy  happy. 

UPA24 

To  inflection  rnarker/preposition 
(Gibson,  1991) 

I  opened  the  letter  to  Mary. 

I  opened  the  letter  to  impress  Mai^. 

UPA25 

Object  gap/preposition  object  gap 

John  saw  the  ball  the  boy  hit. 

John  saw  the  ball  the  boy  hit  the  window  with. 

UPA26 

Long  distance  object  gap 

Who  do  you  believe? 

Who  do  you  believe  John  suspects  Steve  knows 
Bill  hates? 

UPA27 

NP/small  clause  VP 
(Pritchett,  1992) 

I  saw  her  duck  fly  away. 

I  saw  her  duck  into  an  alleyway. 

UPA28 

Coordination 

1.  went  to  the  mall. 

I  went  to  the  mall  and  the  bookstore. 

UPA29 

Multiple  compounding 
(Pritchett,  1992) 

We  admire  their  intelligence. 

We  admire  their  intelligenc  agency  policy 
decisions. 

UPA30 

Semantic  role  switch 
(Pritchett,  1992;  Tanenhaus  &  Carl¬ 
son,  1989) 

I  gave  the  dogs  to  Mary. 

1  gave  the  dogs  some  bones. 

upa:3  I 

Verb/verb  plus  particle 

John  picked  the  boy  for  his  team 

John  picked  the  boy  up  yesterday. 

‘13 


2.4.  (  iiiti.lcii  puih  effects  ;iinl  iinprohleniutie  ajiitfipuilies 


2.4.3  Some  general  garden  path  plicnomena 

In  addition  to  siniply  compiling  Ihc  known  garden  path  constructions,  \vc  can  abstract  a  few 
general  facts  about  GP/UF^A  phenomena,  nicest  of  wtiicfi  are  afiparent  f  rom  examination  of 
Tables  2.5  2.10, 

1 .  Recovenihilitv.  F\'op!c  can  eventually  reet- ver  from  garden  patlis  througlt  delifaenilion 
or  explicit  instruction.  Once  the  “{tuzzie”  is  solved,  the  sentence  may  be  perceived 
us  grammatical.  Fi^ecovcrability  is  titeoreticaily  significant  in  itself  and  also  helps  to 
distinguish  GPs  from  other  kind  of  processing  difficulties,  such  as  those  discus.sed  in 
§2.5. 

2.  Bidirectionality.  GP  effects  can  be  bidirectional  in  the  sense  that  they  are  independent 
of  any  preferred  direction  of  the  resolution  of  an  ambiguity.  In  other  words,  GP  effects 
can  arise  even  when  the  unpreferred  path  is  taken  at  a  local  ambiguity  (say,  the  relati  ve 
clause  reading  over  the  main  verb  reading)  and  the  normally  preferred  interpretation 
turns  out  to  be  correct.  Examples  include  GP4  and  GP13.  This  clearly  demonstrates 
the  independence  of  the  GP  effect  from  the  phenoinena  of  ambiguity  resolution  per 
se. 

3.  Independence  of  length.  Length  is  not  necessarily  a  determining  factor  in  GP  effects 
(Pritchett,  1992).  More  precisely,  the  di,stance  from  the  ambiguous  point  to  the 
disambiguating  region  may  be  very  short,  or  even  zero,  and  still  give  rise  to  a  GP 
eftect  (GPl,  GP6-10,  GP!5,  GP22);  and  the  distance  to  the  disambrguabng  region  may 
be  extended  without  necessarily  giving  rise  to  a  GP  effect  (i;pa3,UPA8). 

4.  Distance-to-disambiguation  effects.  Although  length  is  not  always  a  factor,  the 
material  intervening  bet  ween  the  initial  ambiguity  and  the  disambiguating  point  can 
have  an  effect  on  both  the  immediate  pcceplion  of  grammatical  ity  (GP:;;  (  Warner  & 
Glass,  1987)),  and  the  process  of  deliberate  recovery  (Ferreira  &  Henderson.  1991) 
Generally,  these  studies  found  that  tFie  more  intervening  .material,  the  more  likely  a 
GP  effect  arises  or  the  more  difficult  it  is  to  recover  from.  Warnei  &  Glass  (1987) 
claim  that  this  is  essentially  a  length  effect,  but  length  alone  (as  measured  in  som>’ 
surface  metr'c  .  uch  as  words  oi  syllables)  cannot  be  the  sole  factor  as  deniCKslratcd 
above.  Therefore,  “distancc-lo-disaiabigu  ition  ’  refers  to  the  weaker  c'airn  tliat  the 
intervening  material  can  have  an  effect;  thru  much  is  supported  by  the  data 

5.  Indepctidence  <>,  le.xictil  umhiguity.  lexical  ambiguity  is  neither  necessary  nor  sufli- 
cient  for  GF-’ effects  toarisc  Pritchett  ( 1992;.  This  is  apparent  froin  UP!  17,  UPAlS  7.  I, 
and  l.if’A.?/- 

fc  lndei>cn<lcne‘'  oj  setnantic  ecnitcnt  .Sem.antic  ambigiiily  need  not  eat.i.a,'  a  Gl*  cffc'Cl. 
in'.A  to  exhibits  local  amliiguity  in  the  assignment  of : lie  malic  roles,  but  bod!  senleiK  es 
are  easiK'  priKcssed  (  ianciiitaiis  A  Gatison.  I98U'  I'nieiicii,  199,(): 

(■'n  .  (  iPAtO)  (a)  We  lo.Kied  the  tiuck,,YAi/  vviif!  banana' 

lb)  Wa!  foauerl  tfrr'  to.idc/;// ^,f/  onto  ifn,'  '''iig';',, ja/ 
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ilKini.crJ.  lluiit.'iii  Sentence  Cocnpreltension:  Phenoincna  mid  Previous  Work 


2.4.4  (iarden  path  theories 

‘  (iarden  Path’'  mmlels 

G  ircleo  Path*’’  models  (Clark  &  Clark,  1977)  are  sirnply  single-path  models- — t):ey  main¬ 
tain  a  single  interpretation  during  comprehension.  effects  thus  arise  whenever  the 
comprehender  selects  the  wrong  interpretation  at  an  ambiguity. 

The  LJPA  data  .should  make  unabundantly  clear  that  any  purely  s.ingle-path  theory  is 
doomed  to  overpredict  GP  effects,  no  rnatter  how  accurately  that  theory  may  predict  the 
direction  of  ambiguity  resolution.  Neverthe!e.ss,  the  Garden  Path  model  is  the  default 
assurtipiion  in  many  theories,  such  as  the  early  strategy-based  models’  \  the  Saus.'ige  Ma¬ 
chine/Minimal  Attachment  mode',  and  the  semantically-driven  model  of  .Milne  (1982). 

'lo  make  the  point  concrete,  consider  the  failure  of  Minimal  Attachment  on  (4b)  below. 

(4h;  Ui'A  ')  (a)  5ieth  believed  the  diiectcr. 

(0)  Seth  believed  the  direc'o;  v  as  lying. 

Minima!  Attachment  predicts  that  the  KV  dr>ector  will  be  attached  in  complement  position, 
as  in  (46a).  'Fherefore,  when  this  proves  to  be  incorrect  in  (46b),  a  gai  dep  path  effect  should 
arise;  this  is  clearly  rot  the  case,  .Similar  criticisms  hold  of  the  Altmann/Crain/Steedrnan 
model,  strictly  in  erpreted  ?s  a  Garden  Path  model  (Gibson,  1991). 

Deterministic  parsers 

The  PARSIFAL  modei  (Marcus,  1980)  fan-.s  somewhat  better  than  the  Garden  Path  theorie,. 
described  above,  since  the  basic  lookahead  architecture  io  fundamenially  responsive  to 
the  need  to  handle  unproblematic  ambiguities.  The  lookahead  buifer  in  F'lH.siFAL  holdi 
constituents’'’  rather  than  words  (as  in  the  sin-word  window  of  the  Sausage  Machine). 
Garden  path  effect, s  arise  when  the  disambiguating  syntactic  materia)  falls  outside  ’b-i  ’  hre 
cell  window,  and  the  parser  operates  short  sightedly.  For  example,  coiisidei  prcce.-’.sing  r>n 
(47a)  below: 


(47;  GP!4)  (a)  The  boat  floated  down  the  river  sank, 

(b)  The  boat  [Ooatedlv  |dow'i|/.  !the  river|jv/'. 

After  processing  the  NP  the  boat  i  w  hich  is  then  pushed  onio  the  s4ic).),  the  contents  of  the 
lookahead  ceils  are  .as  shown  in  (47b),  The  disambiguating  final  vero  sank  is  out  o'  sight  as 

'*Iusi  to  be  deal:  <  Uirden  I’alli  wil!  be  used  to  retei  lo  this  parlH.i.ilar  class  o(  llieorie.s,  vvliile  sunhn  path 
iders  1(1  the  [liu'iioinenon  GP  is  an  abbievialioii  lA  ytmh-’i  ruifr  Ihus,  (iaideii  Paih  iiioild.s  are  a  das-,  of 
eardeii  path  {( iP)  dieory  Mie  Gardt/ri  Path  theory  is  sameliines  used  to  refer  speciiically  to  iaa/iet  s  Mniimal 
,-\iiaclini(.-ni  ilicoiy  (I  ia/u*f  I ‘1X7),  bni  not  iii  ih,s  thesis. 

''  ihoiipti  Kiniball  .'I'i/t)  leah/t'd  the  nrebiein  and  assuinc-.i  hum  in  i.:ouiji[ t-tiensui.'i  empioyed  some 
loof.afie  1 1 

'  ‘  Ihe  pici  ,'S,-  s[)ei  ifn  iliop  "i  whn  fi  consiituents  on  npied  cells  in  ttic  iniiters  iV.is  .i()|'.iii‘nth  .i  dcL'ics  i,'! 
(rei'doin  m  die  mode:  tm  liUiny  t  i’Pd..iIa.  see  (Pnic.hett.  IdP.’)  tor  iliscnssiori 


2.4.  (  iiinicu  pMh  ct'fccis  iirid  unpiobkmcitic.  iiinhiguitlcs 
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the  processor  structures  tfie  initial  material.  The  main  verb  reading  is  chosen,  and  ttie  (iP 
etfect  subsequently  ensues  when  sank  cannot  be  incorporated. 

The  primary  empirical  problem  with  PARSIFal’s  lookahead  device  (apart  from  the 
inconsistency  with  immediacy  discussed  in  §2.3.5)  is  that  it  predicts  length  eflects  where 
none  l.kisl  Consider  Just  a  slight  variant  on  the  GP  structure  in  (47a): 

(48;  CtPi4)  The  boat  floated  quickly  sank. 

Since  the  remainder  o>  the  sentence  after  the  boat  can  fit  into  the  three-cell  buffer,  PARSIFAL, 
incorrectly  predicts  that  this  sentence  is  not  a  garden  path. 

The  later  D-theory  model  developed  by  Marcus  et  al,  (1983)  traded  the  power  of  the 
lookahead  buffer  to  resolve  ambiguities  for  the  power  of  dom.inance  relations  to  minimally 
commit  to  syntactic  structure.  The  D-theory  model  is  an  advance  insofar  as  it  eliminates 
the  oversensitivity  to  length.  Unfortunately,  the  move  to  D-theory  also  trades  one  set  of 
empirical  problems  for  another.  In  particular,  D-theory  encounters  difficulty  with  GP7  and 
GPS;  see  tPritchett,  1992)  for  details. 

The  minimal  commitment  models  of  Weinberg  (1993)  and  Gorrell  (1993)  modify  var¬ 
ious  aspects  of  the  original  D-theory  model,  improving  upon  its  predictions.  As  noted  in 
§2.3.5,  however,  two  problems  remain  for  all  the  thco^'les  developed  thus  far  within  the 
deterministic  framework:  accounting  for  immediacy  of  interpretation,  and  accounting  for 
the  inte:ractive  ambiguity  resolution  effects. 


Garden  Path  models  with  constrained  reanalysls 

lo  adequately  account  for  the  Ui'A  data,  a  single-path  model  must  be  .-vuginented  with 
some  kinci  of  reanalysis  mechanism.  The  reanalysis  must  oe  constrained  in  some  fashion, 
otherwise  the  GP  prcdit:lions  woidd  be  lost.  Frasier  &  Rayner  ( 1982)  propo.sed  a  kind  of 
re:inalysi.s  to  augment  the  .Minimal  Attachment  model.  The  motivation  was,  precisely  the 
kind  ef  u!i[)ro'plematic  structure  e.\hib!t(.,d  in  (46).  The  reanalysis  strategy  (dubbed  Steal  NP 
by  Pritchett  ( 1992),  cficr  (Abney,  I '*86;)  is  triggered  by  an  incoming  subjeetles.s  veib,  iur 
exainpie,  iw/.v  in  (46i:>).  A  pi»*vious!y  aiiaiy.sed  NF’  [the  direetor)  cue  then  be  ,aiachcd  a.s  i!ic 
subject  of  the  incon  .mg  verb,  p'-ovided  the  NI’  is  close  enough  in  the  surlace  string. 

Ste.  J  -NP  was  an  advance  ovc  r  odicrCiarden  Path  models  bct:au:,e  it  was  the  lirsl  c.Kplicit 
atempl  to  formulate  a  ic  inalysis  Urategy.  However,  there  are  a  number  ol  sluatc  oimngs. 
Apart  1  lom  tite  ('loblems  of  v;igiii  iormiilatiou,,  the  sti  :,ttegy  tails  to  actually  make  the  correct 
I'lrdu  ,ior.',  legttrdutg  ( 11^  tnid  flf'A  l  oiilrtisls  tPnlr  licit  (  i'*9.f )  firesmits  a  d(.‘tai!ed  ea  ilu|ii;‘). 
I'oi  ex.urijile,  while  the  ^  ase  i.u'  proeC'-siiig  (4f')  is  ari ouiiU.al  toi,  the  (d'  p'Kaht  tioiiH  hn  ihe 
lollow  ng.  Lietm ('.s  an.'  now  niisseii. 

f-lO;  I  |U  1  Wai.ic  Marv  sewa'd  a  sock  fell  on  the  Ih  ai 

(:m),  (it'.o  Sbui  '  i:  w.nned  the  podessoi  vondd  in’  aiigiN. 
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Blank  (!989)  proposes  a  much  more  computationally  explicit  reanalysis  mechanism 
called  boundary  backtracking.  Blank’s  parsing  architecture  is  based  on  a  grammatical  for¬ 
malism  called  register  vector  grammar.  The  architecture  is  basi.:ally  a  finite  state  machine, 
where  state  symbols  are  replaced  with  vectors  of  three-valued  syntactic  features  (the  three 
values  are  +,  and  don’t  care).  Parsing  is  accomplished  by  making  transitions  through  the 
state  space  with  productions  that  match  against  the  state  vector  and  make  changes  to  the 
vector.  There  exists  a  fixed  number  of  boundary  registers  that  save  the  state  of  the  machine 
at  particular  phrase  boundaries  types.  ]f  the  parse  fails,  the  machine  returns  to  a  state  in 
one  of  the  boundary  registers  to  try  a  different  path.  If  the  required  state  is  not  available, 
the  machine  fails. 

As  an  example  of  hov/  Blank’s  machine  predicts  a  garden  path,  consider  processing  on 
the  main  verb/reduced  relative  GP: 

(5  ] ;  GP14)  The  horse  I  raced  past  I  the  barn  I  feii. 

Relevant  boundaries  are  marked  with  a  I  in  the  sentence.  Blank  posits  a  boundary  register 
that  saves  state  at  phrasal  boundaries.  As  the  processor  encounters  raced,  an  explicit 
production  fires  triggered  by  the  closing  of  the  noun  phrase  horse.  This  production  save,s 
the  current  state  in  the  phrasal  boundary  register.  This  is  the  state  ju.st  before  the  noun 
phrase  is  closed,  and  includes  the  option  of  taking  raced  as  a  reduced  relative  modifier. 
After  the  processor  accepts  the  preposition  past,  it  again  saves  state  in  the  phrasal  boundary 
register,  overwriting  the  old  state.  Thus,  when  the  disambiguating  final  verb  occurs,  and 
the  processor  attempts  to  backtrack,  the  required  state  is  no  longer  available. 

One  theoretical  problem,  which  Blank  himself  points  out,  is  that  the  type  and  number  of 
the  boundary  registers  is  unconstrained — Blank  .simply  chose  a  set  that  seemed  reasonable. 
There  are  a  number  of  empirical  problems  as  well,  since  the  model  is  oversensitive  to 
length  effects  (e.g.,  GPl.s).  However,  it  reniams  an  importam  comrihution  since  it  is  the  first 
computational  system  not  ba.scd  or,  lookahead  thal  makes  explicit  GF’/UPA  predictions 


On-line  Locality  Cmistraint 

As  pari  of  his  piograrn  oi developing  .i  strongly  grammalicaily  clci  i  ved  processing  niotici, 
I'riKhctl  (  i'tSH.  IV)02)  pn>j-0',cil  a  t.oMsiianit  tint  chaiactci i/es  p.ieeiM'ly  wlieii  gaideu  fiath 
cfleets  arise,  as  a  tunctioii  ot  the  struefure  of  the  (>relerreii  iiiterpretaoon  aiui  ilie  siruclure 
of  liu-  rec|ui(ed  igluhailv  corieet)  interpretation: 

(  s.’  I  Thr  c  hi  I  in.  -  I.ixaditx  ( 'orisl  taint  f  t  >/,/.(')  I  he  I.ngel  posilion  ( i!  an\'  imusl  [>e 
i’jr,  r/iit'd  or  diniuiniiCil  by  llie  soiiia  e  ['osiiion  ( it  ane  ;  otliei  vv  ise  all.u  iiiiuoi! 
!s  impo'sil'le  !'\  ihi- aiilonial K  lluinan  .Siailenee  I'loi^i-sso:. 

W  e  need  livU  be  eoiu  ei  neil  line  lili  die  pieeise  deiinUions  ( i[  ;m  a  eouneail  and  di  iinin.mee 
Old e  iti.il  ilie’v  au’  ;,n  .irnii i.u rail  relai lonsliip:--  ;:enli  al  to  ( , It  s',  ii!a s .  and  that  1  i ie\  ,iu‘  ['ui e  ! \ 
siiiKlmai  I  el.i!  ii  >ns|ii|  is  iji.' inied  bei  ween  node:-,  ol  pin  .i\e  sO  iKa'uie  n  i,  e  . 

I  ( 1  see  In  'w  i  In  (  U  !  i  '  j  i  ed r  I s  (  4 ’  ed tevUs,  ,:oiisidei  iir-  e.ii  den  j >,ii.n  in  ( d 


2.4.  Garden  path  effects  and  unproblematic  ambiguities 
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Figure  2.3;  How  the  G  I-Iine  Locality  Constraint  (OLl.C)  predicts  a  garden  path.  The  NP  the 
water  is  initially  attached ;  s  the  complement  of  the  VP  drank  (source  position).  The  globally  coirect 
position  (target)  is  not  gc  venied  or  dominated  by  the  source  position,  violating  the  OLL,C. 

(53;  GPi)  After  Susan  drank  the  water  evaporated. 

The  globally  correct  stru  dure  of  (53)  is  given  in  Figure  2.3.  Assume  that  the  water  is 
initially  attached  a.s  the  object  of  drank  (as  is  required  by  Generalized  Theta  Attachment 
(36)).  The  source  and  tar*  ,et  positions  of  the  NP  the  water  are  annotated  on  the  tree  structure. 
The  relevant  fiu.t  is  that  the  water  must  be  rcanalysed  from  the  object  of  drank  (source) 
to  the  subject  cf  e^apot  ited  (target).  (In  GB  terms,  from  the  (ornplcment  of  a  VP’  to  the 
specifier  of  a ;  IP)  Ho  .'eve:,  as  should  be  apparent  from  Figure  2.3,  tiie  source  position 
neither  domi a  itcs  .loi  governs  the  target  position,  in  violation  of  OLLC 

The  Ol  I .( '  (ai  d  the  original  Theta  Reanaiysis  Constr  iin!  that  it  leplii'.'es)  is  an  important 
breakthrough  in  fJP  theory  for  three  reasons,  l  irst,  d  piovide:  a  precise  and  widely 
af>olicabi(’  reanaiysis  constraiul:  the  OLLC  can  be  apnlied  to  anv  s,,ucturc  in  question, 
provided  a  CB  analysis  can  he  [iiovided.  .Second  at  the  time,  the  t  icory  provided  by  far 
the  widest  e  npincal  eo'  erage  with  respect  to  llie  raiige  of  garden  paths  and  unpK'bleinalis 
ainbiguili  s  T!i  nf  the  wc/ik  established  siipporl  loi  the  nioie  general  claim  that  (>!’  elteets 
air  [iineb.  a  lenetionot  syntailie  stiiietiire. 

riu  i. liter  poinl  is  so  iinporlant  thal  I  piesenl  i!  as  a  sejiaiate  hypolliesis  helow: 

iS-t)  I  li(’  hit  Hi  im  id  ( iiiiiliH  Ptilh  I  l\'j>i>tih'M.\  tiaialt.'ii  palh  elli’cis  are  piiwK  a 
limelion  ol  ildlerenees  hetweiui  the  sviitaetie  stiuetuie  ot  ihe  pielerreti  inlet 
pi  el  at  ion,  and  the  s\  a  il.iel  k'  sI  i  tie  lure  o!  I  he  g  lohal  i  \  ern  i  ei.!  i  nie  i  p:  el  ,t!  .on 

fills  IS  a  moR-  geiiei  al  elaiin  ili.ni  ih-c  ( )1 .1  the  ( >1  1  t '  is  om;  posMlde  u  .in  ol  del  in  mg  I  tie 
lel:"',  .ml  ild I eii'iu.. e  I'l'iween  siinelines  i  0-1 1  does  n<U  si'ci'ile  \s  lull  ihe  pielei  letl  inlei  piel.i 
lion  IS  III  (sii  ik  uLn  It  is  .11(0  sa\  ing  that  lire  pieleni  d  inlequel.ilioii  is  puieh  .i  diikiioii 
I  I  ,,ni.u;|u.  S  i  iieime,  I  hat  ;s  .ai  .iltv-ei  fai  i  stiongm  .iiut  iiu1e(H'iidenl  goMiion,  one  ili.it 
I  'i  iief.i  -  in  I  -iiO  .idopl  i  in  the  loi  m  ol  ( leiuaali.o.al  1  het.i  .\ii.u  hnieiit  i  hi  e 
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Frequency/strength  b  ised  accounts 

A  widely-adopted  assumption  is  that  GP  effects  are  a  function  of  the  strength  or  frequency 
of  particular  lexical  or  syntactic  forms.  Under  this  v'ew,  GP  effects  arise  when  a  stronger 
or  more  frequent  lexical/syntactic  form  must  be  abandoned  in  favor  of  a  form  that  is  much 
weaker  or  less  frequent.  Ford  et  al.  (1982)  formulated  a  lexical  version  of  this  hypothesis 
which  attributed  GP  effects  to  morphosyntactic  reanalysis  of  lexical  iLems.  Consider  their 
example: 

(55;  GP22)  The  boy  got  fat  melted. 

According  to  the  lexical  theory,  the  GP  effect  in  (55)  arises  because  the  strong  lexical  form 
for  fat  is  adjectival,  and  this  analysis  must  be  dropped  in  favor  of  a  nominal  analysis.  A 
similar  explanation  holds  of  the  familiar  main  verb/reduced  relative  ambiguities  GPU;  the 
active  form  of  the  verb  is  much  stronger  than  the  passive  participle.  The  theory  runs  into 
empirical  difficulty  by  overpredicting  GP  effects.  For  example,  neither  .sentence  in  (56) 
causes  difficulty; 


(56:  URA.i  I)  (a)  Mike  likes  fai. 

(b)  Mike  likes  f.it  .steaks. 

In  general,  a  purely  lexical  GP  theory  cannot  account  li)r  the  (act  that  lexical  ambiguity  is 
neither  sufticient  nor  necessary  for  garden  path  effects  to  ;irisc  (  A'  4  V; 

A  similar  explanation  for  garden  pati  effects  is  often  g;ver  v-ith  .c.-pet  1  lo  syntaclic 
structures  rather  than  lexical  forms.  For  .  \...imple,  lliet  i;  RPADPR  ''.■odd  (.lust  Sc  C'arpenter, 
1 992)  and  iurafsky  ( 1 992)  model  assume  ;h.e  main  verh/r ,  Juced  relaove  ga;  den  path  derives 
Iron)  the  mncii  iaglier  liequeney  or  strength  of  (..'ic  mail;;..  clan.M'  sui;  uirc  over  the  icduced 
relative  structure  (in  efteci,  evoking  the  (hmc'uica!  .Senioid  siralegy).  riiere  are  Ihtcc 
potential  prol'lenis  with  such  explanations,  the  !ii  si  is  meiliodcM  '.’.ical,  the  scs  >  '!k!  and  third 
empirical : 

I  It  IS  tlilficiill  to  make  am.  predictions  v.  iih  the  tlico;  v,  u  ilhoui  actnalK  i  ihianmi..,:  .soir.. 
ticqi.u.‘nc'v  coiiiils  ol  ss  iilaclic  stuicUircs  ni  ii.iliiiallv  ol.'cui i  mt.'  texts,  i  iu  liic  nioic. 
i'\  cn  if  sncli  coiiiUs  c  onid  he  ohi. lined  .iiul  loimd  lo  he  (.  niisisteitl  ith  sons'  ( IP  eilects, 
It  is  pos.MhIe  (he  nndcilxiiig  cause  el  g.edc:i  paths  could  '’.'li  Im  niissed  Reduced 
id.il  i  'xc  sti  iiclui  cs  m.ic  he  k  ss  I  leqin  lit  m  pai  i  /’C<  on.vc  I'l-.'x  Ic.id  !<  .i  g.ii  dcii  pal!  is.  itoi 
llic  c  lhci  w,i\  aioiiiid  .Sslilcsiiigi-i  j  i9(>S'.  notes  this  po  AiCn!  uilh  lie.jiimH.  x  I'as-  al 
ihcoi  u's  ol  coiiipichepsu'ii  ditlici.lix 

It  sc'c  ms  mil  I  kel  \  I  ii.i.  ce'U  -  n  nel  ion  li  v'qiiencic'- w  'i  i  .icce.um  t  oi  ihc  ;  am  e  i  u  el  tec  I-,  m 
l.il'i  's  .’.a  2  !ii  I  ,ti  ex.u  ipk.-.  II  e  doul'lld  lii.i!  Ilieie  me  .no  siymk.  mi  In aiaencx 


dliv'x". 

i>u(M  urn  t  iaiisk 

■  ,nid  I 

uiali\  v‘  V  I  u:sr  v  t  mi  -.Ii  At  i 

I V  M  I  •  [I 

ipi  m 

|llcil',  \ 

slis  imlli  itiei  ,i  ics  in.o  li.i'.  i- 

.hif.m 

il  -  .iiaauintiim  t>'i  ilm 

'  1  ( / 1'  -  ( '  < 

/ 1  1  Md  < 

/;  f\ 

ol  (  i!  ’  n  i  Isa'i  s  I  ii  'Itsl  m  i;  I  I  i  tu'-.v'  ■  >!ii non  nai.i  den  lem.i  i  .lies  '  I  ml  n.n  del ,  ]  '.d  i  ,  . , . . 
I iide I 'm ideni  oi  vx  ii.iie \  r i  i a  i a 'i isid.a ca !  lo  I h'  il m  si i  oi n'em  , , , n  ei  j  i,  .■  i e n  ed  1 1  nm ii i  e 


2.5  Pursin':  hrcakdowi/  ana  acceptable  enweddungs 


49 


A  further  chaUerige  for  those  theories  i-'  pieseoted  by  the  contparaliveiy  ovcrwhclmitig 
succe^  s  of  sti  ucturally-based  models  which  have  no  f  ole  for  relative  frequencies  or  strei>gihs 
re.g.,  the  Cn-hne  e.r>ca.!!ty  Constiaint  discussed  above). 


Constrained  para.*i!2l  inodeis 

Constrained  parallel  nrodels  (or  limited  path  models)  derive  their  G*^/U?A  predictions  from 
constraints  that  limit  the  structural  interpretations  tfiat  may  be  carried  along  in  parallel  If  the 
disambiguating  material  is  reached  and  the  required  stuicture  is  still  available,  no  garden 
path  effect  arises.  If  the  disamhiguaiing  material  is  reached  and  tne  required  structure 
has  been  pruned  for  some  reason,  a  gat. lei;  pat?'!  effect  does  arise  and  the  parse  fails. 
The  constraints  that  limit  pa;alle]  structures  play  Hie  same  theoretical  role  that  reanal>sis 
constraims  play  in  single  paih  niodels. 

CC  RJR,M)£r<.  (Just  &  Carpenter,  1 792)  is  a  paradigmatic  example  of  a  constrained  parallel 
model.  In  CC  READEP,  the  fixed  amount  of  avai!:iblc  activation  in  the  system  limits  the 
structure*'  that  may  be  maintai.*'ed  in  parallel.  The  model  embodies  directly  the  hypothesis 
that  GP  effects  emerge  because  of  working  memory  limitations.  To  actually  derive  pre¬ 
dictions  from  the  model  requires  specifying  how  structures  will  differentially  consume  the 
activation  .escurce  and  hov  the  system  responds  to  po^'^ntial  over.9owG.  V'.^hile  the  model 
could  in  principle  <tcrount  for  a  range  of  effects,  these  necessary  specifications  have  not 
been  workea  cut  m  detail.  To  the  extent  that  they  are  s|>ecified,  the  predictions  depend 
oi.  different  frequencies  or  strengths  of  alternative  structures,  leading  to  ^he  difficulties 
described  in  the  section  above. 

The  parallel  model  in  (Gibson,  199  i )  docs  present  a  detailed  set  of  structural  metrics  and 
principles  for  pruning  interpretations,  deri'-'cei  primarily  from  GB  syntax.  These  principles, 
in  effect,  are  another  instantiation  of  the  Structural  Garden  Path  Hypothesis  (54).  While 
presenting  an  example  of  the  theory  at  w'ork  would  require  introducing  too  much  detail 
here,  the  important  fact  to  note  is  that  (his  model  shares  many  of  the  strengths  of  the  On-line 
Locality  Constraint  presented  eiriicr.  the  theory  is  applicable  to  any  structure  that  can 
be  given  a  GB  analysis,  and  it  accounts  for  a  w'lde  range  of  GP/UF’A  effects.  It  therefore 
clearly  establishes  that  detaded  accounts  of  garden  path  effecis  may  be  developed  within  the 
constrained  [laralle!  framework.  Alshough  loosely  nuilivated  by  working  memory  capacity 
constraints,  the  role  of  working  memory  is  not  as  clear  in  this  model  as  in  CC  RF- ADER. 


2.5  Parsjnj»  breakdown  and  acceptable  embeddings 

(  enter  enib dded  (or  seif  eii'hedded)  senterii  es  jcti  as  (57)  were  among  die  lust  con 
siriK  tioie.  studied  tn  psvehoi.igisis  arnl  iineuists  eoruierncd  wth  the  dtsiinetion  ihetween 
iingnisiie  conipcienee  and  perlorrnanee  (.Miiier  A  Cliomskc  1‘RVs,  Miller  A  Is.ird,  !96'1 ). 


1  '1  / !  [  lie  e.il  dial  ihe  birri  lha!  die  mouse  ehased  scared  ran  awav 


C'lhjptcr  'luiimv  Sentence  Coinptchcnsion.:  Phenowena  and  Ffc\  ions  Work 


(rctitcr-cribedtJed  consUuctions  are  iiue  resung  beeausc  <hev  are  notoriously  difficult  to  coin- 
preherid,  but  cannot  be  ruled  out  as  arigratvur.atical  vritbout  introducing  ad  hoc  restrictions 
into  liiC  grammai  to  p>'event  eivibeddiiigs  beyond  a  certain  level.  They  are  the  paradigmatic 
example  of  an  imucceptaklc  but  .gratn/w/Z/etj/ sentence  (Chomsky.  1965).  riuis,  most  psy- 
c.bo!ogisls  and  linguistics  alike  have  a.ssurned  that  there  must  be  a  psychological  rather  than 
•  mguistic  e.xplanation  of  the  difficulty  with  center-embeddings  (e.g..  (Miller  &  Chomsky, 
!96.‘C  De  Roeck  et  al.,  1982;). 

2,5.1  Farsuig  breakdown  detined 

The  unact.cptability  of  (57)  is  an  examp'e  of  the  general  phenomenon  of  parsing  breakdown 
(?B).  Parsing  breakdown  occurs  when  a  listerser  or  reader  is  unable  to  comprehend  and 
perceive  a  sentence  as  grarnrnati.'al  without  great  difficulty.  In  this  broad  sense,  parsing 
breakdown  includes  garden  path  effect.s,  but  1  will  generallv  use  the  term  to  refer  only  to 
breakdown  that  cannot  be  attributed  to  misinterpreting  Iocol,  ambiguities.  This  section  is 
concerned  vx  ith  iust  this  narrower  class  cf  break'^own  effects. 


Evidence  for  parsing  ?>reakdown 

Nearly  all  of  the  experimental  evidence  hearing  on  parsing  breakdown  involves  the  center- 
embedded  construction  introduced  .thove.  The  basic  finding  is  that  doubly-embedded  object 
relative  claiises.  as  in  (57),  cause  g;  at  difficulty.  This  has  been  demonstrated  in  a  number 
of  ways;  subjects  consistently  judge  ccntcr-embeddings  ungrammatical  (Bliimenthal,  1966; 
Marks,  1968),  and  perform  poorly  on  simple  verbatim  recall  tasks  (Miller  &  Isard,  1964; 
Foss  &  Cairns,  1970),  tuiiimed  paraphra.se  tasks  (Blumenthal,  1966;  Stolz,  1967;  Larkin  &. 
Burns,  1977),  and  questic.is  that  test  comprehension  (Blauberg  &  Braine,  1974).  In  most 
of  thc.se  experiments,  the  baseline  for  comparison  wtis  performance  on  right-branching 
sentences  such  as  (58),  which  carry  die  same  amount  of  content  in  a  different  syntactic 
structure. 


(58)  The  bird  chased  the  mouse  that  .scared  the  cat  that  ran  away. 

Performance  on  right  in.  nching  versions  of  ccmc!- embedded  sentences  was  always  better 
and  did  not  stiow  the  severe  dccicniont  at  two  levels  of  embeddings  (hat  the  center  embedded 
sentences  did. 

Because  jiarsifig  breakdown  dike  garden  path  effects)  can  be  rt'vealed  by  hngiustic 
accepi-ibility  juiigniciUs,  fingui.'.is  aiul  oliier  lesearchiMs  have  generated  a  wide,  raiint:  >0! 
unaci  ••■pSablc  tnil  (putatively)  erammatieai  structures.  Tables  ?.  1  I  and  !,(’  jiiescm  a 
lis:  launaiils  lioni  (lilison  t  1661  ),  v'.liich  Useji  diew  hi'as  ily  on  rv|),e!  ( 

.■M'iiougii  the  uiiaccepiabiiii'r  of  most  of  ihese  sentences  was  deternnned  In'  lufoi  nial  sui  ve  v 
and  d:i',i-)oic  not  suhiect  to  the  neois  of  ruullipie  esj'eruaenis  aeij  {miluisie  '.-xpei  inn  utai 
l>ai  adigi'i"-.  ;t  ae vci  tficless  eonvinciiu’lv  demonstrates  th.il  liie  [.'iierx itnerion  of  pais.mg 
hic .ikdovM,!  cxteiiu-,  ticvond  ju>,i  o|i|ert  re!.a,i\s‘  ■  .,‘nici  cint'eddiiii's  ))i\t  .is  I.tu.  gaidcii  isitit 
i-  Peel  e  .;ersjs  hes  orui  ihe  t  .uionu'al  main  s etls/foiluced  relative  conNiujC a .-n 


2.  5  Parsing-  hrcukdom'  ,knd  acceptable  cinbcddscgei 


5! 


rABU'  2.1 1 :  A  collection  of  construction,s  caii  jing  parsing  breakdown  (part  )  of  2). 


Type 


Example 


PB 1  Cenler-enibeddcd  object  relative 

(Miller  El  Chomsky,  1953;  Miller  <& 
Isard,  1964) 

PP2  Center-embedded  coject-relati''o, 
dropped  complementizers 

PB3  Center-enibedded  subject-  relative  in 
Wh-question 
(Gibsor,  199  It 

PB4  Center-embedded  subject-relative 
(Gibson,  1991) 

PB5  Embedded  subject  sentencvc 
(Kimball,  1973) 

FB6  Relative  clause  with  embedded 
subject  sentence 
(Gibson,  1991' 

PB7  P'ost  verbal  relative  clause  wdh 
cir  bedded  subjcci  ,scnte-ncc 
(  Gibsoti,  '!  991 ) 

P)38  Suli'cc!  scnteLcc  onibcdc.cd  in 
Sc  ntentia!  comp'  .iKeiit 
I'tPhson,  1991) 

!’l.'  )  Eii'hi'dded  Si'mcfilia!  v.A)inp!e:ncnt 
tGibvmi.  i  9'd  ) 


The  ,:nan  that  the  woman  that  the  dog  bit  likes  eats 
fish. 

■j.'he  man  the  woman  the  dog  bit  likes  eats  fish. 

Who  dvi  John  donate  the  liirnii.ure  that  the  repair¬ 
man  that  the  dog  bit  found  to? 

The  man  that  the  woman  that  won  the  race  likes 
eats  fish. 

That  that  Joe  left  bothered  Susan  surprised  Max. 

The  woman  that  for  John  to  sn*oke  w'ould  annoy 
woiks  in  *his  cTice 

The  compiuiy  hired  the  woman  that  for  John  to 
smoke  would  annoy. 

Mary’s  bclict  that  for  John  to  .smoke  would  be 
annoyiiig  is  apparent  due  to  her  expression 

.k.-hn's  'uspicion  -iial  a  rumor  that  ‘he  electron 
had  not  been  nin  farr.ly  was  iriie  moiwuled  him  to 
mvestigaie  ii  rthei 
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Ta  Bl  E  2. 1 2;  A  co'lectioii  of  constructions  cau.‘:ing  parsing  br'-akdown  (part  2  of  2). 


Tvte 


Example 


PBIO 

.jeniential  compiement  embedded  in 
relative  clause 
(Gibson,  1991) 

The  man  who  the  possibi  .ty  that  students  are  d<in- 
geroi’s  frightens  is  nice. 

PBll 

Wh-question  with  sentential 
complement  with  embedded  relative 
(Gibson,  1991) 

Who  does  the  information  that  the  weapons  that 
the  goveinnient  built  don’t  work  properly  affect 
most? 

PB12 

Cleft  with  modified  sentential 

complement 

(Gibson,  1991) 

It  is  the  enemy’s  defense  strategy  that  the  infor¬ 
mation  that  the  weapons  that  the  govenimeiU  built 
didn’t  work  properly  affected. 

PB!3 

Clefted  subject  sentence 
(Gibson,  1991) 

It  is  the  enemy’s  strategy  that  for  the  weapons  to 
work  would  affect. 

PR14 

Pseudo-cleft  with  modified  sentential 

compiement 

(Gibson,  199n 

What  the  infomsation  that  the  w'eapons  that  the 
government  buiil  didn’t  work  properly  affected 
was  the  enemy’s  defense  strategy. 

PB15 

Fseudo-cieft  with  subject  sentence 
(Gibson,  199)) 

■What  for  the  weapons  to  work  properly  would 
affect  is  the  enemy’s  defense  strategy. 

P&!6 

'f'hough-preposinj.'  with  n.cditied 
senlcritial  cornpienienf 
(Git;.>.on,  1991 ) 

Suiprisuig  though  the  inforrnu!.ion  that  the 
weapons  that  the  gevernmen.  built  didn’!  work 
properly  was,  no  one  took  advantage  of  the 
(pistakes. 

FBI  / 

riiough  •prepos.ng  '.-viih  subject 

sfuicnce 

tvjibsoi., 

■Surjvrising  ihough  tor  the  wcajtons  to  wia.k  piop- 
eriy  would  be  lor  tht;  general  populaee,  F  wooUi 
not  surprise  some  military  offiinals. 

11^  jmOWMWLUUMi 


hnuikLU'w:)  :ind  ^iccsptdhlc  cn}hcdoitK:^> 


2/£.2  Ai  cefiL‘-*.b?.e  tinhi'fkfrags 

Recuf'sior.  or  eiTibedding  itself  is  not  nca;ssai  ily  problematic;  nqiii  cranching  rtructrircs  cao 
beombed  ied  dceoly  without  causing  paising  breakdown.,  as  p58)  iHusb'ate.s.  The  ciiffercnce 
between  liie  'wo  sijiicmres  .is  g;''e!i  schematically  below'; 

:,>b)  Rif^ht-hr:inch:n<:,  !a  ••Ici-  di 
(601  Center-i^mi:>edairig:  ).,.| 

Recursion  is  ''cerptable  m  (60)  (/iily  to  one  level,  and  acceptable  in  (5b)  to  any  level.  Right- 
branching  '.a’uctures  are  just  one  c-i  acceptable  eiTibedding,  Tor  cKamitlc,  Cowper 
( i  9  /6)  o! esents  the  following  fairly  cotnple.'i  structure  involving  a  subject  sentence  wdlh  an 
embedded  opjet' t  - r'c lati ve; 


(61 )  That  the  food  mat  John  (..Meted  tasted  good  pleased  him. 

Ttjoles  /.  1,3-2. 15  oreren'  a  lange  of  such  acceptable  sbactures.  Thrjugh  urjt  all  of 
the  structure:;  actually  tnvoivc  ru.r  ,piei;n.hcddings,  they  all  serve  as  useful  constraints  on 
thec'--  ics  'cf  parsing  breakdown.  I  will  continue  to  refer  to  f  tv-;  as  acceptable  embeddings 
(AH). 


2-^,2"  gtkivral  parsing  brea-kclown  pkexjojr^sna 

■in  additioti  to  lit.. dig  the  relevant  strrem'-e;;,  vve  cm  tbdract  a  few  impoitant  facts  about 
ivrrdng  b'  cai: down  iron  tht  enipir'ca!  ctuaies itad  me  c  rllecnon  of  .slruc,  ures  in  Tables  2. 1 1- 
d.I.i. 


i.  ^nde(;i^fh'Jc\c:t  o)  ai:ih:~uihy.  Aidiough  Trc  .'ehniticf'  of  the  parsing  breakdown  c’ass 
given  above  excludes  ambigu.ty  effect.s,  it  is  nevcuhcless  irnp'vrtant  empirical  fact 
that  ImcI  am''>igi.ny  is  not  necessary  foi  parsing  bieakclown  to  occur.  In  particular, 
this  cannot  he  tK  cr.pfinaiion  foi  difhculty  o.i  rhe  centor-embedded  con.structio’.'s,  it 
is  possmie  to  makv*  Ihe.st  tonstiuc’ioris  locally  ambiguous  by  dropping  the  compic 
mrnti^Mrs' 

i.'il,  Fi:':)  The  cat  the  iTio''  the  r'oi:.sc  chased  scared  ran  away. 

Friv'  ca!  tfie  bov.l  the  mouse  and  'he  doy,  ra.a  away. 

(I>!,  FH! '  The  cat  that  die  bud  that  the  motew  ciaisci  scared  lun  away. 

Scifi  ‘iKC  (63)  IS  locally  ambiguous  hc'ween  a  si'iug  o;  reduced  u  iatives  cud  a 
ei.vrip.r-’,':'!  .'loun  j  hi.:,i.sc  (63).  Cut  the  unacccplaluhty  o!  tin.  v  wLT.siruclh..i  [iciMsts  cvci! 
w  id;  die  ■..(  ihtt  overt  c.onii.'leiswnti/ers  ((v' )  (Blimicnlnal ,  I  ‘/6();  Claut.crg  tk 

thaine,  ibT*  F  oes  A  f ‘a'lris,  1  b/(!;  [.urkm  A’  Hurnc,  i'F/V  Mark^,,  ; ‘,'08  Mdie: ok 
r,,i>  d  1  '.>(■■!  !.  Some  ;'.’'idics  have  shown  thaf  '^0  iicuiics  seeh  as  '  02;  .ne  /m  er  dil  tk  nll 
ilem  ((>4)  under  c'.'Oa):'  ir!c:i.  iircs  it  odo;  ik  Ciarreu.  Hakes  <k  lats  ,,  ‘4/i),,  bill 

liie  uiiaceiy.Sabdov  ol4'64.i  leinains  (itT,!lywwi.iiiioir.'d 


Husnan  Scriicrcc  Coiuprtihfn^.-ior,:  Phenomena  and  Previous  Work 


^4  2. 


Tm.ble  2.13;  A  collection  of  acceptable  entbedded  structures  (part  1  of  3). 


T>'t’F;  Example 


.AEl  Right  branching 

(Miller  &  Chora, ‘.ky,  1963;  Kimball, 
1973) 

AE2  Left  branching 
(Kimball,  1973) 

.M;3  Single  ieiative  chiuse 

AEa  Wh-(nit,stic:i  with  relative  clause 
subject 

(Gibson ,  1991) 

/iE5  .Rosi-verbai  c  enti'r-embedded 
subjecC-reladve 
(Eady  &  Fodor,  1981) 

AE6  Post-verbai  center-embedded 
^'-bieci-relative 
(Eady  F.:)dor,  1981) 

.AL7  Pest-dative- verba!  center-embedded 
subject-relative 
(b'a  Jy  &  Fodor,  198!) 

.aLH  Subject  .sentence 
(Kimball,  1973) 

\F9  Subject  sentence  with  embedded 
rekitive  clause 
(Cow  per,  1976) 


The  dog  saw'  the  cat  v/hich  chased  the  mouse  into 
the  house  that  Jack  built. 

My  cousin’s  aunt’s  dog’s  tail  fell  off. 

The  man  that  Mary  likes  eats  fish. 

What  did  the  man  that  Mary  likes  eat? 


1  saw  the  man  that  the  woman  that  won  the  race 
likes. 

I  saw  the  man  that  the  woman  that  the  dog  bit 
likes. 

John  donated  the  furniture  that  the  repairman  that 
the  dog  bii  found  in  the  basement  to  charity. 

Thai  Joe  left  bothered  Susan. 

I'hat  the  food  dial  Jofui  ordered  tasted  good 
pleased  turn. 
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2.S.  Parsing  breakdown  and  acceptable  entbeddings 


Table  2.14:  A  collection  of  acceptable  embedded  structures  (part  2  of  3). 


Example 


Type 


AEIO  Topicalization  followed  by  subject- 
relative 

AEll  Fronted  clause  followed  by  subject- 
relative 

AE12  Nominalized  embedded  subject 
sentence 
(Kimball,  1973) 

AE13  Post- verbal  untensed  subject  sentence 
(Gibson,  199!) 

AE14  F’ost- verbal  untensed  subject  sentence 
embedded  in  sentential  complement 
(Gibson,  1991) 

AE 1 5  Sentential  complement  with 
embedded  subject-relative 
(Cowper,  1976) 

AE16  Sentential  complement  with 
embedded  object-relative 
(Cowper,  1976) 

At;i7  \Vh-(iiii‘s(ion  willi  seiilcnliai 
coniplemenl 
((.iibson,  1991 ) 


John,  the  boy  th.at  the  dog  bit  likes. 

Whi  e  Mary  slept,  a  sock  that  the  dog  chewed  fell 
on  the  floor. 

Joe’s  leaving  bothering  Susan  surprised  Max. 


I  believe  that  for  John  to  smoke  would  annoy  me. 

Mary  held  the  belief  that  for  John  to  smoke  would 
be  annoying. 

The  report  that  the  armed  forces  that  anived  first 
would  have  to  stay  for  another  year  surprised  me. 

■J'he  thought  that  the  man  that  John  liked  saw  the 
dog  scared  me. 

Who  did  the  information  that  Iraq  invaded  Kuwait 
affe:l  most? 
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Table  2. 15:  A  collection  of  acceptable  embedded  structures  (part  3  of  3), 


Type 


Example 


AE18  Post- verbal  relative  clause  embedded 
in  sentential  complement 
(Gibson,  1991} 

AE19  Post-verbal  doubly-embedded  senten¬ 
tial  complement 

AE20  Cleft  with  embedded  relative  clause 
(Gibson,  1991) 

AE21  Cleft  with  sentential  complement 
(Gibson,  1991) 

AE22  Pseudo-cleft  with  embedded  relative 
(Gibson,  1991) 

AE23  Pseudo  cleft  with  sentential 
complement 
(Gibson,  1991) 

AE24  Though-preposing  with  embedded 
relative 

(Gibson,  1991) 

AE2.S  'riiough-pieposiiig  with  sentential 
complement 
(Gibson,  !99l ) 

Pied  piping 

(Pickering  &  Barry,  1991) 


Tlie  pentagon  employs  many  bureaucrats  who  the 
infonnation  that  Iraq  invaded  Kuwait  affected. 

The  professor  did  not  believe  my  claim  that  the 
report  that  the  school  v/as  corrupt  was  biased. 

It  was  a  fish  that  the  man  that  Ellen  married  saw 
on  the  highway. 

It  was  the  Americans  that  the  information  that  Iraq 
invaded  Kuwait  affected  most. 

What  the  woman  that  John  married  likes  is  smoked 
salmon. 

What  the  mmor  that  the  accused  man  had  robbed 
a  bank  inlluenced  was  the  judge’s  decision. 

Intelligent  though  the  man  that  Ellen  manied  is, 
he  has  no  sense  of  humor. 

■Shocking  ti  ough  the  new  .  that  h  aq  invaded 
Kuwait  was,  even  worse  news  was  yet  to  come. 

.ichn  loimd  the  saucer  on  which  Mary  put  the  cup 
into  which  1  poured  the  tea 


AE26 


2.5.  Fursing  hrcukcinwn  and  ucccptiiblc  CfuhLxklings 
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2.  In.suj'jkicncy  of  embedding  depth.  Deep  embeddings  alone  do  not  necessarily  cause 
parsing  breakdown.  The  range  of  con.structions  in  'fables  2. 1 3-2. 1 5  make  this  clear''’ . 

3.  Fairly  sharp  drop  in  acceptaldlity.  There  is  a  rather  sharp  drop  in  acceptability  of 
center-embedded  structures  from  one  level  of  embedding  to  two.  Subjects  almost 
universally  judge  one-level  embeddings  to  be  grammatical  and  two-level  embed¬ 
dings  to  be  ungrammatical  (Biumenthal,  1966;  Marks,  1968),  and  performance  on 
paraphrase  tasks  drops  to  chance  levels  at  two  levels  of  embedding  (Larkin  &  Burns, 
1977). 

4.  Little  effect  of  explicit  instruction  and  training.  Subjects  continue  to  find  center- 
embeddings  difficult  after  explicit  instruction  and  training  on  the  structures  (Blauberg 

Braine,  1974),  in  contrast  to  most  garden  path  sentences.  Some  subjects  even 
continue  to  deny  that  the  structures  are  grammatical  (Marks,  1968).  One  interesting 
result  from  the  Blauberg  &  Braine  (1974)  study  is  that  subjects  were  able  to  increase 
their  ability  to  comprehend  center-embeddings  from  one  to  two  levels  of  embedding; 
performance  on  comprehension  tests  still  dropped  to  chance  at  three  embeddings. 

5.  Independence  of  length.  Long  S'mtences  do  not  neces.sarily  lead  to  breakdown 
(Schlesinger,  1968),  nor  do  short  sentences  guarantee  comprehensibility  (PB2). 

6.  Effect  of  semantic  content.  In  untirred  paraphrase  tasks,  performance  on  semantically 
supported  (SS)  center-embedded  sentences  is  betterthan  performance  on  semantically 
neutral  (SN)  center-embeddings  (Stolz,  1967).  Examples  of  each  type  are  given 
below: 

(65;  FBI)  (a)  The  bees  that  Tie  hives  that  the  farmer  b  iilt  housed  stung  the 
children  (SS). 

(b)  'fhe  chef  that  .lie  waiter  that  the  busboy  appreciated  teased  ad¬ 
mired  good  in  jsicians  (SN). 

7.  Independence  of  shott-term  item  oieniorv.  fhe  I  ark  m  &  Burns  (  1 9  77)  study  demon¬ 
strated  that  subjects  may  be  able  to  ivca’i  ilr.  v.  'uds  in  cenlei -emli  rlded  structures 
without  being  able  to  correctly  p  lir  the  wo-ds  (i  e,  correctly  parse  tlie  structure),  fhis 
shows  th;it  the  ability  to  coinpn  heiul  the  siriicture  i.s  at  least  partiallv  iiidependcnt  of 
short  term  memory  loi  tlie  woids  in  the  sentence. 


2.5.4  i’heorie.s  of  parsing;  hreaktlown 

r'leail'v  all  iheoi.ic's  ol  parsing  t'leakdown  assunii  that  sfiiictuiiv  likr  cimlL'i  ciiihiddeii 
K'latiws  aie  tliilictill  to  compK'tu  tui  Irccau-w  ol  miiik  iuail  on  .  umpiitalional  icsuiiic'.'^. 

'  ■  .'H  I '11 '..I :  ig  l\,  nilllll|>j\  ,sfli  r--inti;"iiilv'il  iiii'’ili:-.lii  sUiii  Inn's  iniu'  i  nil's  l')i'  .u'l  Sell  li'si  ii  s'ri 

(  I  U(,S  I  j  iiM  III',  lilt' ru.  r  iisiii;:  Iti'lucw  ii' \ls  suj'gi'sl  ill  s’ I  Iril  s;.'!)  ''iiiiicililir.n  ut  [i.u  c  iitli;'' n  ai  H'liMi.ks  (i'ltsi:l 
l>'.  i'aniii.is  ii.'l  I'.'i ''laiirM )  il"r-,  ii,  i|  iii  a  Ir.iC  !■.'  |'',-|>.i'|'Uiii?'.  el  an  s' i  aim  I  i  it  ii  .il  i!  \  f  li '-a.  i'\ ri ,  lie 

h  nnil  i  hai  in  ■  i  uik'  i.  .isr ,  -a i tin*,,  is  ,  laiini-a  that  iiiitsil.iri'  sat  i  .ii.ii  i lii't i  l '  'U'  sti  ii,  t  m  ,ii  1  ill  t"i  iiiail )  tr  1  .  w  t  n 
: '  I  .'1 1 1;  I  Ml  Is  .  !  - , !  >  1  I  ,1  i  ill  K:  iiru  li'.ii  tiew  ii  M  ii!i'!  j  s  at  t!ii.-;.i.  u-.ii  s  t s  w  li  'i  I  aspi'M  1 1 '  |  i.ii  s  i  iis ' 
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Table  2.16:  Structural  metrics  for  parsing  breakdown. 


Structural  metric 

Accefiability  limit 

Ratio  of  nodes  to  terminals 

Miller  &  Chomsky  (1963) 

unspecified 

Degree  of  self-embedding 

Miller  &  Chomsky  ( 1 963) 

unspecified 

Open  sentence  nodes  (top-down) 
(Kimball,  1973) 

2 

Local  nonterminal  count  (top-down) 
(Frazier,  1985) 

8  (interred) 

Unsatisfied  syntactic  requirements 
(Gibson,  1991) 

4 

(Marks  (.1968)  suggests  that  perhaps  the  structures  are  actually  ungrammatical.)  The 
remainder  of  this  section  explores  two  classes  of  parsing  breakdown  theories:  structural 
metrics  and  an  hitectural  theories. 

Structural  metrics 

Structural  metrics  are  theories  that  define  some  metric  over  syntactic  structures  which 
predicts  the  perceived  complexity  of  parsing  the  structure.  Many  of  the  theories  also  .specify 
a  limit  beyond  which  structures  str)uld  become  unacceptable.  The  theories  differ  in  the 
degree  to  which  the  metrics  are  motivated  by  some  underlying  computational  architecture, 
but  all  assume,  at  lea.st  implicitly,  that  such  grounding  in  an  architecture  could  eventually  be 
discovered.  To  the  extent  that  the  metrics  are  successful  in  predicting  parsing  breakdown 
they  can  potentially  help  to  guide  the  search  for  the  architectural  mechanisms.  Detailed 
empirical  crilujnes  ol  each  o(  the  stniclmal  metrics  may  be  lonad  ia  (Ciibson,  16dl); 
(herctore  the  discussion  here  will  be  kepi  brief, 

fabir*  2.ib  summarr/es  the  sirueuira!  metrics  For  each  ihcoiy,  the  piciposeti  Imgui  aie 
measuring  unit  is  idenlilied,  along  with  the  limit  lor  aeieptable  struclures  (il  s|iccilied) 
Sonic  u;  (he  nietnes  ojiei.ite  on  intci iiiedi.ile  [);.a,se  iieis  and  are  theceloi e  reiatixe  iu  ,i 
jaailicular  parsing  algorithm  istiategy  foi  eiiuineiating  nontenniinds );  ihcse  arc  nalea  whc'c 
lelevani. 

Mill/'/  iind  (  \  nu’lru'S 

Milk  r  A  t  'hi  I!  ii'T. \  (  I  On  i  i  pi  i  ,[)i  i-.ed  a  rmniin’i  ih  sir  n,  lural  iaunpltw  ii\  n.casi.f  as  i,i' jirJir  > 
ilcvn  r  c'f  ,a7/  amhi  ihlifiy ,  ,ii)!.!  tto  ir  u>  tnnitUdlivOc  !  h,  Ni'l!  eiiibi.  ikliii!'  sneiiu  anipK 
alali  s  th.it  iiii'i  e  iitapii  \  Si’l  t  inbi  i I- iiai  n'i  lu  tin  as  will  r.,.  ni,.:,,- ,  livlii  iil ,  u  >  .i  a j  ■ '  Liana  !  fa.- 
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tlieory  is  not  merely  descriptive  because  the  metric  derives  from  a  formal  analysis  that 
shows  that  self-embedded  (as  opposed  to  right-  or  left-branching)  structures  are  precisely 
those  structures  that  eventually  cause  trouble  for  any  finite  model  of  language  performance. 
Chomsky  and  Miller  stopped  short  of  specifying  any  concrete  bounds  on  self-embedding. 

The  node-to-termina!  metric  is  the  ratio  of  non-terminal  nodes  (or  total  number  of 
nodes)  to  terminal  nodes  in  the  oarse  t''ce.  This  ratio  provides  an  estimate  of  the  amount  of 
computation  required  per  terminal  node.  As  a  predictor  of  processing  breakdown,  the  node- 
to-terminai  ratio  fails  to  draw  the  correct  contrasts  between  difficult  center-embeddings 
and  other  acceptable  embeddings  (Gibson,  1991).  This  is  not  too  surprising  because 
Chomsky  and  Miller  clearly  did  not  intend  for  the  metric  to  account  for  difficulty  on  center- 
embeddings,  since  they  had  pi'oposedrhe  independent  self-embedding  metric.  Nevertheles.s, 
nearly  all  subsequent  theories  of  processing  breakdown  hav'c  adopted  some  form  of  either 
the  finite  state  explanation  or  tire  non-terininal/terminal  ratio. 


Principle  of  Two  Sentence.s 

Kimball  0973)  proposed  that  human  parsing  proceeds  top  down,  and  no  more  than  two 
sentence  nodes  can  be  parsed  at  the  s.ame  time.  This  rules  out  doubly-embedded  relatives 
like  (57)  since  three  S  nodes  must  be  open  (the  main  clause  and  the  two  subordinate 
clauses).  This  pi  inciple  accounts  ‘'or  the  iinacceptability  of  a  number  of  other  constructions 
as  well,  such  as  embedded  subject  .sentences  (PL'.5).  Surprisingly,  however,  the  principle; 
ovcrpredicts  pai.sing  breakdown,  as  (ol )  above  uc’-moiistrates  ( AE9).  Upon  encountering  the 
relative  clause  that  John  .ordered,  a  top-down  parser  has  three  open  S  nodes;  tire  main  clause, 
die  suhjec  (  sentence  (that  the  food  tasted  good),  and  the  en-beddeu  rclaiivc.  Nevertheless, 
the  sent!  ncc  does  not  produce  the  breakdown  associated  with  dotihly-cmbedded  object 
rclaiives. 


Maximal  laca!  nantern.nud  eamit 

I  r.i/icr  (  1 9,S;')  i  nn  dilieil  <  'liumsky  and  Miller's  oi  leiii.il  node-  lo-tenniual  nietnc  so  tliUl  >!  is 
,1  iocal  rathei  lliau  gloicd  measiin  ,  in  an  allenipl  lo  heller  eapiii.e  iiueiieiil-bv  inoiiieui  pio 
eessiiK’  diflieull',,  A  ioeal  ;io!itci inma!  eoiint  is  the  eriunt  oi'  nonleniimai  nodes  inlioriueed 
'vliile  parsin;’  a  siioii  segment  -of  the  iii[iiit  sire.iin-  i  i .u'lci  tlefiii'  ri  a,  '-lioM  seLMiicnt  as  'liree 
.id|.!eeiil  lerniiiails.  The  niaviinai  Ioeal  nonternimai  eoiinl  is  the  hiisa'sl  leeai  iionlemiin  li 
1.  oi'ia  that  oeeiii  s  dm  iiig  a  sente  nee  1  la/iei  assuineil  th;il  S  node  eeainied  ..s  I  S  while  all 
tlii'i  !i; >ii!e!  iiiin.ils  euiniteil  i  !  tie  j’lreihelioii  i>  thai  sen'emees  w  nli  tiie!!  hie.ii  nonlei  inin.il 
1.  oum  ,  wi'l  he  niun;  diliieiill  lopioiess  'dian  senieiu  es  ith  lew  i  oim''.  I'inue  ’  1  slun'.s 
I  .\  II  e  ..ifiij'les  of  how  Itn^  ineli  a;  ts  eoM,[)ute-l 

I  i,;/h-i  ex.impK-.  ,1  miniliei  o!  siih|iel  -.eiilenee  and  eealm  eiiihrdd' d  eonsli  m  t  nei'.  lo 
';!ji|'0!i  iIk'  iiuoie  I  his  I-,  tnsi  iik  li  le  w  ineh  eoi  lOi,  i!\  I'aedieis  die  i, mill. t\l  helween 

enie !  emi  ualdei  In  I'.'-.  i:i  i  ih|ei.  I  posii  a  ui  \  -.lil'iee'!  |  x  isil  le.n  i  1  I-;  I  \  , \l  i .  1 1  ,id\  >V  I  out  u . 

CIS  I  )  )  I  l-.w'  e\  ei ,  ,r-  <  ■  ih-.eii  ,  1 ' 'O  I  i  |'n  imn.  ont ,  ilk'  me!  i  le  I  .;i  h.  i.i  ,!e.mii!|  i  oi  die  I  mss 

I  iiduM'  di.il  doiihl  \  eii  itakiue'd  i  el .11 1  \'e  .u  e  mo'e  di  I  Ik  all  lli.i!  '■  as  els  t  mlk'dds'ui  i  l'I.C  r.  e'. 
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Figure  2.4:  Compuang  maximal  local  nonterminal  counts.  The  locality  of  the  metric  is  three 
adjacent  terminal  nodes  S  nodes  count  as  1.5,  all  other  non-terminal  nodes  count  as  1. 


(66)  (a)  The  man  that  the  dog  bit  ate  the  cake  that  the  woman  saw. 

(b)  The  man  that  the  woman  that  the  dog  bit  saw  ate  the  cake. 

Frazier’s  metric  incorrectly  assigns  the  same  value  to  (66a)  and  (66b).  The  fundamental 
problem  is  that  the  metric  predicts  difficulty  only  wher,  there  is  a  high  density  of  nonterminals 
introduced  over  a  short  span;  specifically,  when  three  high  complexity  words  (m  the  sense 
that  they  produce  c  high  nonterminal  count)  are  immediately  adjacent.  This  may  be  the  case 
for  center-embedded  structures  with  dropped  complemenozers  (PB2),  of  the  kind  Frazier 
examined,  but  parsing  breakdown  may  arise  even  when  the  nonterminals  are  somewhat 
more  evenly  distributed  across  the  sentence,  as  in  (66a) 


Gibson ’s  overload  metric 

Gibson  (1691)  developed  a  detailed  metric  within  the  GB  framewurk  tliai  attributes  a 
cumulative  cosi  to  maintaining  structure  with  locally  onsatislied  syntactic  requirements.  In 
particular,  Ihc  tiicory  a  si'-iia  a  cost  to  maintaining  thematic  role  bearing  elements  which 
have  not  yet  received  then  tlieinaiic  roles,  and  to  Icxiciil  pru|ections  wliich  do  not  yet  have 
then  lexical  requirements  satisiied  {Of)  shows  the  stnicluu’  [miiluced  hv  tiihson’s  paisci 
ii[)on  irrocessiiig  the  third  NP  in  a  center  embedded  construction; 

(tV/;  mu’)  (a.)  riie  m.iii  the  worn, hi  the  dv)g  hit  likes  eats  ti:-,h. 

(hi  l/rlv/'tlvniiut,  h/dv,- (. (live  llie  woman,  j,v  I V  EMI//'  I  ■./  t/ic  ,/,.-g  !  !|  1 1 

I  hcie  arc  ii\  s‘  Nl’s  i  linee  lexical,  twi?  nonl-.‘Mcal!  that  icijune  theniati'.;  roles  iviU  lack 
lluan  file  llicoax'  elates  ihai,  iive  su(.li  iiKat  vi.'iiitK'n^  inhich  in.u  i;i\o|\e  sMiiailic 
i(S)Uu  eiiH'iils  otiiei  itum  ihein.ilu  rok,'  assiginuenl )  is  eiiongb  'o  t.  .aiise  hieakihui  ii, 

'A  lull'  ieiii  IS  .lA.ci'i't.ilde  <  iihsoii  dcinonsti ates  ih.ii  die  mcioe  .S  i  oniiis  loi  nearb  .nl  iH  i  ie 
[Misiiie  bieakuowii  elte'.Is  ni  l.ihles  ,’,1  |  2  1,'  I'lns  'U  as  .i  ni.iU'ii  eiiquiK'.il  ad\, iiue  ou  , 
[Me\ ii  s  is  ihtM!  i;,s.  vshivli  were  laiiii.uiK  I'aneei  ued  oiils  wi'lme-ilei  emhedvliiij's 
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Table  2.17'  Architectural  theories  or  parsing  breakdown. 


Architecture 

Limited  resource 

Proposed  limit 

Push-down  automaton 
(Yngve,  1960) 

Stack  cells 

7 

Subroutine  architecture 
(Miller  &  Isard,  1964) 

Return  addre.-s  memory 

1 

ACT 

(Anderson  etal.,  1977) 

Comro!  variables 

1 

Sausage  Machine 
(Frazier  &  Fodor,  1978) 

Lookahead  window 

6  words 

YAP  finite  state  machine 
(Church,  1980) 

Stack  cells 

unspecified 

Register-vector  FSM 
(Blank,  1989) 

State  for  tracking  clause  level 

3 

Unification  Space  architecture 
(Kempen  &  Vos.se,  1989) 

unspecified 

unspeciticd 

PI)P  network 

(Weckciiy  &  HIman,  1992) 

Hidden  units 

unspecified 

Comprchcui.isoii/paising  architectures 


Archilectina;  dici.  '  I  's  (.feline  !lie  ■.•(.nc  ilioiis  lor  [)arsinL’  hrcak(.lown  in  icmis  u!  some  spccilit' 
c(.)nipntat]OiKtl  aivtmeeiure  Unlike  ti\e  sirnclurai  metrics,  tlie  relationship  to  an  architecture 
IS  niht.'ren!  1\'  speciiieil  as  p.iri  ot  tlie  theory  vV'inle  ihis  is  a  clear  iheoretical  aihantage  o^'cr 
the  sliucluial  metiics,  no  arr'l'iile(.tural  throrv  vi’t  proposed  eonu's  close  to  Ihr‘  eo\e,age  of 
C iibsoii’s  metric. 

l  ahle  I  -  SLiminaM/es  the  architcetiaal  dieoi  ics,  hoi  e..ich  I'leoi  \,  the  i  .d  o  aiii  C(  aiipu 
lalional  resource  oi  mei  hitmsm  is  uleiitihed.  aionp  itli  the  pioposed  liinil  on  that  lesouice 
I  it  speeitievi) 


)>/!M  c  di'i’di  f’li':!  h' 

Viuoi's  liiodel  w.ts  the  lirsi  aii'Mnpl  i  >  dexelo;'  ,i  will  smsiiied  ei'n.'r'.ilatn 

,ice.  iinil  'I  sii  I  Hi  ,il  Cl  iiui'ie',  il\  1  he  niode!  is  csseUii.iiK  .1  1  aish  di  )\v  n  .niii  <1  ii.iioi:  i  t'l  ).  \  i 

d I . i !  e e i  1  e 1 . 1 1  e s  ph i . e .e  '-.i ;  1 1, a i s  1  i *  t is s  in  a  n  >p  d ■  ■  \s  u  i*  1 1  ! i ■.  ee-  li i  ; i i : i ;e ;  x ! t  Ic  ‘ 1 1 e ! >  1  i  . i s 
>  1: !  e I  n.ii  I  aacndci!  .is  1  nu uiel  oi  si.a iiei!i.'e  p,  odiua ion,  i he  .ailv : i.uoi i  c .ui  ! se  nsisi  1 . a  ['.e  .11  le 
. ! s  xi.  e I !  !  I  ic  !  cs!  (  1,,- 1 11  ii  1  r  lie  \  e  m ipi  >scd  on  the  i ! lode  1  .'s  lo  1  n n  1 1  l!  u  ..I  .a  k  d;  - :  n  !  1  1 1 '  si  - ,  e  1 1 

.1  \  .il  iie  11  .oi  i  \  .lied  h\  M  1  llei  s  f  .snunis  1  hi'oi  \  oi  si  10 1 1  as  11 1  me  1  noi  x  1  \  !  d  1  .a  i  os( , . 
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There  are  a  number  of  empirical  problems  with  the  model,  the  most  serious  being  that 
it  predicts  unbounded  left-branching  (AE2)  to  be  as  unacceptable  as  center-embedding — a 
consequence  of  the  purely  top-down  algorithm  (Miller  &  Chomsky,  1963).  This  is  a  serious 
problem  because,  although  English  is  predomiiian'ly  right-branchi  ng,  there  exist  languages 
(e.g.,  Japanese)  that  are  predominantly  left-branching. 

Despite  the  empirical  problems,  Yngve’s  mode)  has  a  combination  of  theoretically 
desirable  features  that  many  later  models  do  not  share:  it  is  instantiated  n  a  well-defined 
computational  architecture,  it  makes  clear  and  preci.sc  predictions  on  any  given  structure, 
and  the  architectural  limitations  have  some  possible  grounding  in  independently  developed 
psychological  theory  (namely,  the  Miller  7+/-2  theory). 


Subroutine  architecture 


Miller  &  Isard  (1964)  suggested  'bat  embedded  c.!auses  may  be  processed  by  calling  a 
subroutine  for  parsing  clauses.  If  there  is  a  severe  limit  on  tlie  number  of  return  addresses 
that  may  be  stored,  then  the  processor  will  encounter  difficulty  with  self-embeddings,  if 
only  one  return  address  m;iy  be  s'ored,  then  this  would  ;iccounl  for  the  difficulty  on  doubly- 
cfiibedded  relatives.  form  of  this  hypothesis  shov.'cd  up  later  in  (Anderson  et  ai.,  1977) 
atid  (Blank,  1989),  di.scussed  be.!ow. 


Acr 


Anderson,  Kline  a  ml  I  ,c  vvis  i  1 977  file  velopci!  a  model  of  language  eompiehension  witliin  the 
A('T  theory  of  cognition.  Since  ACi  ’s  f.roeedural  componeni  is  a  production  system,  tins 
iei.]inrei.i  s[iecit ynig  the  set  ol  f>i  Hluei.ons  used  m  p.irsing  and  mterpietalion.  The  control  of 
the  jiai  sing  (iroiiuetions  is  accoinplished  via  a  set  o|  eoiitro!  variables  that  mainiain  the  state 
iieeessai  v  to  handle  multi, 'rle  eiub'aiiimgs.  Tiicse  v.iiialdaw  peinu!  coiiltol  to  he  leiuriied 
lo  inoeluetioiis  parsing  liig!”'r  level  (.  'aiises  aitiv  emhetidi'd  el.uises  arc  complete.  I  he 
niotlei  h.is  enough  i.ainirul  \  ai  ia!iles  to  haiulle  onlv  one  lex  el  ol  einheddmg,  so  hieakdoxx  n 
otvuis  xx'ilh  double  emheddiitr’'  The  iiioilrd  :s  esseiiliallv  an  iitNiaiiliat ion  ol  flie  (Milka 
A.:  Is. lid.  lupd)  ihexnx'  ot  suluontme  iiiteinqition,  fjiihouiuled  light  biaiH  tong  as  ui  (.sS)  is 
not  probiemalK'  sinee  control  neetl  iiexer  retnrii  to  the  mail'  el.uise  productions  .Andeison 
cl  ,il  ackni >x’.  ledi.’.ed  liiai  the  limit. ition  is  esseiilialix  ailntiai  v  bul  ('01111x11  out  ih.il  .an\ 
loii’i 'u/iJt'i!  mciimix  of  st.ile  woiiiu  h.oe  (o  fie  iiiaini.uited  in  ,\(  [  \  sxni.tnlic  netooik 
.Since  dlls  netxxoik  is  iioi  I'omplelelx  rehahic,  hie.ikdoxxn  xxoiiUl  exentuallx  occui 


/  ht  ,S  [  Ca  \  i  1 'r  '  \lih  hlfle' 

I  1 .1,'  le  [  A  1  Ol  ioi  ,  I  S'  s  )  .ii  ’  I  il  >n;e  1  h(-  dll  t  uailt  X  with  x'ciitei  ein!  >eikln  n,"-  lo  pi  o!  he  in  ^  i  h.i  1  ;  ■  ic 
I  'I  ’I '  li.e  !  I ;  j  'ei  1  01  n  11 II  >'  I  hr  inii  nil  pin  .i.'..il  sei'i  in-;i|  i;,;'  j  j  u;  \  .is--,  .iiiie  i  h.il  ll  ic  ii  i  n  i  .i  I  .id  i.c  cni 
iiiMM  |)iii.i-,cs  oi  ih;'  iciuiin.i!  xrih  phi.iM-s  xxiil  i  a'  me.  a  o'ci  i  \  i.iiis  j  i  r  li  d  .i-  luiihanrd 
I'll,  .I'-.r  1 1\  lilt  I  'I'i '  \\  let  ii  mu'-i  1 1  i.itr  ll  s  \i  I  ncim  I  le  dci  isi,  ais  ix  a!  lonl  .e.  i  co-  i  < '  die  i  ea  i 
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c>f  the  sentence  that  fail  outside  the  six-wo/’d  wiitocw,  ffejwtver,  there  are  difficult  center- 
erubeddiugs  tvhich  fall  completeJv  v/i»h}n  the  wunkvA’: 

(68;  PH3)  Women  men  gitls  lO've  meet  die 

Since,  the  PPP  is  able  to  generate  S  itOvle;;  on  iis  own.  (68)  should  not  cause  it  difficulty. 
,A.ctua!ly.  tht  theory  Fra/ticrarid  r-odorrrt'po.’-e  is  morecomplex  than  this,  since  they  suggest 
f.hitt  the  P?P  may  be  garden-p^athed  even  though  all  the  relevant  information  is  available  to 
ii  m  thf  vvi'.dov'.  This  par?  cf  the  theory  was  not  worked  out  in  much  detail  and  begins  to 
■tfiderr'inie  th.n  w'.ioie  approach  of  having  a  restricted  window. 

YAP  p.nite  rtute  machnie 

C.'httrch’s  !  1980)  YAP  is  an  impienten»ed  system  that  combines  the  determinism  hypothesis 
of  Mai', us  :  1980)  with  the  idea  that  the  human  parser  must  be  a  finite  state  machine  with 
a  tas-ber  limiteJ  amount  of  state.  The  architecture  is  es.scntially  that  of  PARSIFAL,  with 
the  exception  thar  she  stack  is  bounded.  Unlike  Yngve  (1960),  however.  Church  did  not 
'  eniure  to  propose  whaa  the  limit  on  the  stack  might  be,  except  to  note  that  it  must  be  fairly 
shallow  given  the  difficulty  on  center-embeddings. 


Rcf'ister  vector  machine 


Oictnk  's  ( 1989)  parser  (introduced  earlier)  maintained  state  iu  a  fixed-length  vector  of  three- 
vaii.cd  sy-uacii';  featuies  and  control  variables.  F’art  of  the  vector  is  devoted  to  keeping 
'rack  of  vrhu'  ctao.y^l  level  is  currently  being  parsed.  The  vector  only  has  enough  stale  to 
ir  .x  ts  three  levels  (meui  clause,  embedded  clause  1 ,  and  embedded  clause2),  so  it  is  unable  to 
{larse  inply  ceu  ci  embedded  relatives,  indefinite  right-  or  left-embedding  doe.s  not  invoke 
ciause  shifting  ar.d  the  parser  handies  these  easily.  As  specified,  there  seems  to  he  enough 
Slate  pane  eveu  difhcult  doubly  embedded  relatives.  If  the  state  vector  was  restricted  to 
beUer  maich  human  j'jerformant e,  the  arciiilecluic  would  make  the  same  predictions  (both 
correct  and  incorrect)  as  Kinibairs  ( I9(’3)  i’rinciple  ot  I  wo  -Sentences. 


Vnificiiliocs  Snui  t'  nn  hiU’Ctiirv 

Kfiupen  A’  X'osse  i  i'-/,H6)  tievelojH'il  and  implemeuted  ;i  noxcl  contnut;itior;al  arcinlecluie 
Iscscd  oil  aciiv-atinn  liecav  and  sinuilitcd  .innealmg.  IIk-  sYstimi  vcoiks  b\  itiiKonu: 

asiai'iu  segments  troni  a  levicon,  aik!  Ilien  ait.H'lnng,  the  segmenis  m  a  stiH'lu.stn.  [uoeess 
ih.ii  l;i\  )is  nioic  tnghlv  iicliv.ned  iiiHies.  i  he  segii'en''.  cons, Ovi  ol  luxies  linked  fi\  funciional 
sNiiiaiiic  lel, limns,  .iiiaehmcnf  ini  iii^  h\  nnO' ing  indunl"  I  nodes  .Siiue  uloaiion 
di.as'Ss  mou'  icicii!  noile.s  are  mon'  aefise  Hie  teiigrci ai  ii o!  the  .inncahne  (siocess 
IS  .1  !  li  ill. !  ion  id  the  tol.il  at  1 1  \  at  ion  m  me  s\  sttic  si  I  lie  or  occss  i.n  ad  iial  Is  '  \  <  '.■  i' s  IvcmpcM 
.(ini  ^  ossc  picsciil  u'siiiis  '.jiint  mg  itiai  (fie  svsteiii  icuscs  singis  cinl'etitied  .'chitiic  t  hiuscs 
siiiisut  oul  OK',  o!  die  lime  (>u!  paiscs  litfii  s  cnihcdtltsl  'cLiliscs  oiiK  .Jh'Sii 
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of  the  time.  The  corresponding  results  for  singly  and  doubly-embedded  right-branching 
structures  are  97%  and  82%,  respectively. 

Although  this  is  an  e.Ktremely  interesting  result,  Kempen  and  Vosse  offer  no  immediate 
explanation  for  the  performance  difference  between  the  two  structures;  it  is  unclear  whether 
this  is  truly  the  first  nonparametric  theory  of  center-embeddings,  or  w'hether  there  is  some 
variability  in  the  basic  architecture  which  could  lead  to  different  results. 


Weckerly  and  Elman 's  PDF  model 

Weckerly  &  Elrnan  ( 1 992)  constructed  a  connectionist  system  that  models  some  aspects  of 
human  performance  on  center-embeddedings.  As  the  network  is  given  words  one  at  a  time, 
it  encodes  the  content  of  the  words  and  the  surface  order,  in  a  bottom-up  parse.  Since  there 
is  a  fixed  amount  of  structure  devoted  to  encoding  this  stale,  the  information  in  the  network 
eventually  degrades  as  the  state  gets  large.  After  processing  the  three  initial  noun  phrases 
in  a  doubly  center-embedded  sentence,  the  ordering  information  is  lost  and  the  parse  fails. 
However,  the  lack  of  order  information  can  be  compensated  in  the  network  by  semantic 
constraints,  so  that  in  semantically  supported  sentences  (see  $2.5.3),  the  network  can  still 
manage  to  give  the  correct  output.  This  is  the  first  model  of  proce.ssing  center-embedded 
structures  that  begins  to  account  for  semantic  effects.  However,  it  remains  to  be  seen  how 
the  model  will  scale  to  handle  the  range  of  effects  listed  in  Tables  2.1 1  -2.15. 


Fiinite  state  and  self-embedding 

The  key  feature  that  mos'  of  these  models  have  in  common  (particularly  the  models  of 
yknder.sori.  Blank.  Chomsky  and  Miller,  Kimball,  Wanner,  Weckerly  and  Elman,  and  Yngve) 
IS  a  commitmeni  to  finite  stale  There  are  two  gtxxi  reasons  for  'his: 

!  Anv  fini'e  machine  will  eventually  fail  to  recogru/e  self  embedded  stru(:turc,^.  In 
other  words,  self -embedding  gram: oars  fall  outside  the  computational  scojx.*  of  finite 
automata;  equivalentl),  no  regular  grammar  is  ‘'-'f  ^..ibedding.  (The  proof  involves 
die  pumping  lemma:  see.  toi  example  (I-ewi.sty  f’apadimitriou,  1981  )) 

2  x'homksy  1  1064)  proved  that  it  is  ont\  seif  embedding  tiiat  causes  languages  to  fx- 
coniext  free  .More  p'eciselv.  it  /  isarontexl  free  language,  then  it  is  not  a  tiUiic-Matc 
language  if  and  onlv  if  all  ot  its  gramma.s  are  self  embedding 

liuis.  simpis  aiioptnig  a  tunic  state  ihikIcI  make',  pieciseh  ihr  rigtil  «.u!  \Mlh  rcsfX'C': 
ccntci  l■[^l^>cu'li!ug^  tiiuk  state  .nacinries  vvdl  al w a'>  s  c\ eiiiuaih  (ac!  K'cugm.!' Msme 
cciuc;  emlicddcd  sU  ucliiies.  vsluie  ai  die  same  time  thes  aic  in  |uhki|>Ic  ,',,p,i,b.lc  vg  vcs 
i'Cipi/i'h;  III!  lit!;,')  kinds  i  >!  stiiksuics  pKHiuccd  l>\  cofiicir  fiCc  erainmars  i-,-  g  uidct.nUc 
oglu  6,  ,iiii  inng  I'i  !c!l  hmiK  lung -.liUi.  iuie' i 
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2.6.  \a)ni',7.'jry  of  rttc  phenonteim 


2.6  Summary  of  the  psieoomeaiia 


This  section  biiefiy  sunii'^arizes  ttie  major  finciings  concerning  the  sentence  processing 
phenomena  discussc  in  this  chapter,  and  eioses  wiih  a  discussion  of  how  the  phenomena 
together  provide  great  mutual  constraint  for  any  comprehension  theory. 

The  produce  of  comprehension  Comprehension  produces  a  syntactic,  semantic  (in- 
ten.sionai),  and  referential  (ex*en.siorE<- '  representation.  There  are  functional  reasons  for 
all  three  representations  lire  functional  analysis  is  further  supported  by  actual  practice 
in  builaing  v'orking  systems.  Tne  existence  of  a  referential  representation  is  empirically 
supported  by  expei  :ment.i  that  demonstrate  the  confusability  of  referentially  and  inferen- 
tially  equivalenl  expressions.  These  same  experiments  show  that  in  some  cases  the  final 
memory  for  texi  hs  primarily  lefcrential  On  the  other  hand,  the  independence  of  a  semantic 
represercation  i,'  supported  by  experiments  demos  strating  that  in  some  ca.ses  memory  for 
text  may  be  pnmaniy  semantic.  The  nature  of  the  memory  is  influer.ceu  by  factors  such  as 
ihe  difficulty  m  constn-  :ting  a  mental  model.  A  mental  model  is  a  referential  representation 
that  rep.esep.LS  one  particular  situation  at  a  time,  and  maintains  a  direct  correspondence 
between  tne  elements  in  me  representation  and  the  elements  in  the  domain.  Evidence  for 
this  form  of  representation  conies  from  experiments  contrasting  models  with  more  powerful 
alternatives  (logic)  on  a  variefy  of  reasoning  tasks. 


Immediacy  of  interpretation  and  the  time  course  of  comprehension.  The  referential, 
.semantic,  and  syntactic  representations  are  computed  immediately  <uk1  incrementally  on  a 
word-by-  word  basis.  In  reading,  the  time  course  of  this  processing  ranges  from  50-  iOOO-f- 
ms  per  ^vord.  The  evidence  for  immediacy  comes  from  a  w'ealth  of  experiments  using 
speech  shadowing,  eye  movement,  and  cross-modal  priming  tecbnique.s.  There  have  been 
suggestions  that  some  kinds  of  syntactic  inforinalion  is  systematically  delayed  (.such  as  verb 
subcategory),  but  thus  far  the  evidence  weighs  in  favor  of  universal  .syntactic  immediacy. 
T  here  are  limits  on  the  immediacy  of  reference  resolution  and  mental  mrxlel  construct’f'n. 


Although  the  prtxresses  are  immediately  initiated,  completion  may  lx-  delayed  due  to  Ihe 
struciure  of  the  text  itself,  or  computationa.!  lim'tatioris  of  comprehension.  A  numlxir 
of  cxpennienis  have  provided  evidence  for  the  distinction  fvetween  automatic,  or  on-line 
riKKie)  construction,  and  cognitive,  or  deiilxirate  mode!  cxinstruclion.  The  depth  and  nature 
ol  prcxessing  dcf)end.>  oti  a  ounifxi  of  factors  such  as  time  avaiiabie  and  the  goal.s  in 
conspf  eheiKSion 

Snucrural  umhi^uity  resolution  Bo'h  on  iuie  anibigintv  resolution  and  final  jxefericd 
inter pretations  rnas  be  ndluenix'd  by  structural,  icKical,  sce  lantic.  and  contextual  factois 
No  singie  os'iiii. iplc  or  p-teferences.  di  clas,-.  ot  preferences,  itas  Ixen  found  lo  univetsaUv 
characUTi/e  arnbijni'iy  resolunon.  large  number  of  crnptrr:  ai  stedk-s  stnav  tfias  liosh 


naoduiar  .liisi  inteiac  tivv’  “ttccts  .uiav  aris-  itcio.ss  a  vs  idt*  varseiv  ol  cinU' xt,'-,  ssruc' 

tvpcs,  and  vxperunciital  pa!adigyf!^.  Of  tht  majoi  oafsirtc  pie  fere  nc  t  >  fH'svon  sCvi, 
soiiic  conihinaiion  os  Right  Associaiiou  and  kMcal  I'fClCienciO  fuis  hce>i  fv'und  to  be  she 
Kiosi  iiibus.!.  li!  stur'ies  ol  tiainrai  corpoia.  /Vhisough  \hete  is  some  ev  idei 


'»>(  IV  nnsitci 


afriO!ii;;  <>1  ssnu,  turai  [Oifallehsn:  in  the  parving  process,  ibe  pjiydosu  al  rcMilts  oi  MfSiie 
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Garden  path  cjfects  and  unproblematic  ambiguities.  GP  effect;',  sometimes  arise  'A-hcn 
,1  reader  or  Hsteiie.'  misinteiprels  a  local  ambiguity  and  cannot  easily  recover  the  correct 
interpreiation.  The  result  is  an  impression  of  ungrammaticality.  Tiiere  are  a  wide  variety  of 
structures  associated  with  GP  effects  The  evidence  comes  from  a  range  of  expei intents  and 
informal  survey;-,  using  grarnmaticaiity  judgment  tasks.  Complementing  the  GP  structures 
are  an  equally  rich  set  of  unproblematic  ambiguities,  which  do  not  cause  difficulty  no  matter 
which  interpretation  of  the  local  ambiguity  proves  correct.  GPeffectsare  recoverable  in  that 
aGP  sentence  may  be  perceived  as  grammatica.1  once  the  correct  interpretation  is  discovered. 
GP  effects  are  generally  independent  of  length  (though  .some  distance-to-disambignalion 
effects  have  been  detected),  lexical  ambiguryy,  semantic  content,  and  the  assumed  preferred 
interpretation  of  a  given  ambiguity  (i.e.,  GP  effects  may  be  bidirectional). 

Parsing  breaKdovm  and  ncceotable  embeddings.  Parsing  breakdown  occurs  vvhen  a 
listene;  or  reader  is  unable  to  comprehend  and  perceive  a  sentence  as  grammatical  v^  ithout 
great  Ji.^heuky.  Parsing  breakdown  technically  includes  GP  effects,  but  parsing  breakdown 
may  occur  independently  of  ambiguity.  Breaktiown  on  unainuiguous  center-embedded 
Structures  has  been  demonstrated  using  a  range  of  ri/easures,  including  grarnmaticaiity 
judgments,  reca'l  tasks,  paraphrase  tasks,  and  question  answering.  There  is  a  fauiy  sharp 
drop  in  acceptability  from  one  center-embedded  relative  cUusc  to  two.  .w  variety  of  struc¬ 
tures  causing  parsing  breakdown  have  been  discovered  (though  none  have  the  thorough 
empirical  backing  that  center-embeddings  do).  Complementing  the  ?B  structures  are  a  va¬ 
riety  of  acceptable  embeddings  such  as  right-branching,  which  may  ne  iterated  indefmhely. 
In  contrast  to  GP  effects,  instruction  and  practice  have  only  margii'al  'injiact  on  the  accept¬ 
ability  of  difficult  structures.  Semantically  constrained  0Utf  .‘  ia!  does  boost  perforrnaoce  on 
untimed  paraphrase  tasks.  PB  effects  are  independent  of  leugt.h,  and  also  ir  tiepemient  of 
short  term  memory  of  the  words  in  the  sentence. 

Figure  2.5  gives  a  directed  graph  surnmari./,ing  the  const,- aitiing  ()rof>ertie.s  of  the  phe¬ 
nomena.  'The  grapl)  should  be  interpreiea  as  follows:  .,¥  f  means  that  phenoniena  .V 
constrains  the  theoictical  e.\plaiiaEiou  of  phenomena  T.  Tiach  aic  v  explained  be''>w  (tlie 
arcs  arc  labeled  with  lower  case  letters). 


(a)  i  he  mechanisnis  explaining  garden  path  e'lfetl'.  must  iu.H  iic  mo  swa.l;  S;,a:  hey  l.iii 
t(  account  for  unpiobleriiatic  anthjguiiies.  l  ikev  i.se,  the  u  echamsnis  expi.uning 
utijiroblematic  ambiguities  mu-a  not  be  so  powclui  ihal  ihey  Iasi  to  :Cv:outtl  loi 
garden  path  elf  -els. 

( b)  In i mediae  s'  ol  iiilc;  pretatson  constraui:-,  the  expianaiiou  <d  usipioidt  'natc  .iinhigu:!  s:  - 
fi\  mlmg  out  i.eileiii  kitsds  ol  lOokahCuU  (js  (.H’ia’.  ed  cottimn-iiien'' 

u'  cli  I  he  piieie 'MKTa  mu  r'Sinuhhg  ainhqactv  iesidaonn  .■mwtoe:-  (  OV'I  }'\ 
ni  pail  o!  lor  i-Miiitine  the  imiial  UiU  ipo-uaiovi 

i  c  ;  Mod,.:' Is  sM  ai hi'i  V u it v  ivSvilntion  rnn-c  ix'  con  O'-tem  v,  iih  itiUordLs  v  o'  oa  ‘i  i  v 

Pi  I  o  i.‘ h  v  ou..!:ii.d  li'Xic.;f  ■-.mt.ani.a.  an,.! -..oruevi  i.J  xp  .-nsi  „■  ndi.o!',' 


. . Hi  itii—ii  n  iifliw  "mm.i  ih  ii  h  niigfara»B»fffgiwgo^n 
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Figure  7.5:  How  the  phenoiiiena  mutually  constrain  the  theory.  X  —  Y  means  that  phtmiamena  X 
constrains  the  thcotetical  explanation  or  phenomena  K  Bat  h  aic  is  explained  in  the  text. 


(g)  'I'he  iitechanisiiis  tor  handling  acceptable  embeddings  must  be  consistent  wii  h  imme¬ 
diacy  of  syntactic  parsing. 

(It)  The  mcchattr  ms  explaining  pacsing  hreaktiown  must  not  be  so  weak  that  they  fail 
io  acc  iuni  lor  tise  acccpfahle  enifieddmg's.  I.ikcvvisc,  t!ic  niechanisrns  that  handle 
the  acceptaiilc  cmbecidings  nmsi  not  be  so  powertul  that  they  fail  to  predict  parsing 
breakdown. 
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Chapter  3 

The  NL-Soar  Theory 


There  seems  no  way  to  enter  the  fray  with  a  little  theory  and/or 
a  highly  approximate  one.  To  do  so,  is  to  invite  the  v^rath  of  the 
linguistic  gods.  Full  b  town  from  the  brow  of  Zeus  or  nothing! 
—  Allen  Newell 

To  criticize  the  pages  on  language  that  follow  would  be  like 
shooting  fish  in  a  band. 

—  Angry  linguistic  god' 

HIS  CHAfTl?,R  DESCRIBES  THE  NL-SOAR  COMPREHENSION  MODEL  built  within  the 
I  Soar  architecture.  The  first  section  lays  the  necessary  foundation  by  examining 
-IL  the  nature  of  cognitive  architectures  generally,  rev  iewing  the  Soar  architecture,  and 
establishing  the  NL  Soar  approach  of  studying  language  and  architecture.  The  core  of  the 
chapier  describes  the  model  itself,  along  with  examples  illusirating  its  operation.  We  then 
step  back  and  explore  the  space  of  potential  Soar  comprehension  n  odels,  to  motivate  some 
of  liu  major  design  choices  in  the  current  NL  Soar.  "  .‘c  chapter  concludes  w  ith  a  summary 
o.f  I  lie  theory. 

3J  Preiiiiiinaries:  arciiitecturvs  and  Soar- 

iteeausc  tins  itiesis  purjiurts  to  [iresent  an  un  hilci  tunillx  ha.^td  ibeorv  ol  comorclu  riNion, 
It  IS  inipmtaiil  to  I'xpkuii  exartly  what  that  means  and  why  it  is  a  desirable  as[r<.‘et  o!  the 
flic  cxpLuiatiuii  that  tollowsean  be  taken  as  jsut  ot  tlie  ansuei  to  trie  ijiiestton 
Isn’t  .Soar  gist  a  progt animing  language  used  to  iruidcrnt  ui  NL.  Smti  '.'  \Vc  shall  sec  ibat  it  is 
I'ai  moic  til  111  that,  and  \\c  w  ill  take  uji  the  issue  again  in  ('baptci  ff  consulei iin.’,  tlicic  the 
bi  oadci  issue  i  >1  .So.ii  s  lole  in  tfie  theoi  v  and  its  eiiipti  le.il  cos  ei  ;ige 

Ik'ick  liixk  li'di  'k  if:' r.vv  ut 

f  \ii  1 N  1  't  ih I s  fit  Ml  .no  hascii  'Ml  an  untiin sh(\l  niaiuiM.  i  ij u  (h.n  I  ks  .is  v*  .  m  k  ti it  i  m i  \‘*  il li  ,M K- n  Nc“ vk  cl i  i n 
iIk  SiM  in  ;■  1  if  !  '■  '\n\  inisc  cju.  Minis  cm  cnT.'!  f lutf  remain  arc  ol  ct  >111  sc  enf  n  c!  \  inv  i cs;hMVsi  lal  >f  \ 
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Chiiptcr  3.  The  NL  Soar  Theory 


3.1.1  What  is  a  cognitive  architecture? 

A  cognitive  architecture  is  the  (relatively)  fixed  computational  structure  that  supports 
cognition  (Newell,  1990).  An  architecture  specifies  the  available  processing  primitives 
(operations  on  data),  memory  structures  (support  for  storing  and  retrieving  encoded  knowl¬ 
edge),  and  control  structure  (specification  of  how  processing  unfolds)  Architectures  are 
universal — they  are  just  those  computational  structures  that  admit  programs.  For  behavior 
to  emerge,  both  the  architecture  and  the  content  (the  program)  must  be  specified. 


The  central  role  of  architecture  in  cognitive  science 

The  central  tenet  of  cognitive  science  is  that  cognition  is  a  kind  of  computation.  If  cognition 
is  computation,  there  must  be  an  architecture  to  support  it.  In  this  view,  discovering  the 
nature,  of  menml  architecture  ;s  the  most  fundamental  question  in  cognitive  science,  for  a 
theory  of  mind  must  he  an  aichitecturai  theory  (Anderson,  1983;  Newell,  1990;  Pylyshyn, 
1984). 

This  has  significant  impact  on  how  we  construct  and  evaluate  cognitive  models.  If 
a  cognitive  model  is  to  make  the  strong  claim  that  the  processing  steps  in  the  model 
correspond  to  the  processing  steps  in  the  human,  then  the  model  must  incorporate  some 
assumptions  about  the  underlying  architecture,  because  it  is  the  architecture  that  defines  the 
nature  of  the  available  processing  primitives  (Pylyshyn,  1984).  Pylyshyn  calls  this  form  of 
correspondence  strong  equivalence. 

Making  explicit  architectural  assumptions  also  helps  to  clarify  what  carries  theoretical 
content  in  implemented  cognitive  models.  For  example,  if  a  d  uring  Machine  w  as  seriously 
proposed  as  a  theory  of  mental  architecture,  then  a  cognitive  model  could  be  constructed  for 
some  particular  task  by  developing  a  computer  .sy.stem  that  forms  a  virtual  lui  iiig  Maeliiue 
aichitectiire  and  tlien  programming  that  architecture  w'ith  tlie  program  arul  data  relevant 
to  the  task.  1  he  paiticular  implementation  of  the  d  uring  Machine-  -wlietiiei  li  is  coriiwl  iii 
I  .isp  oi  ( on  a  parallel  inaeliiiie  or  serial  j.irocessor  i.s  u  rclex  anl.  What  l^  ihcorciu  ai i v 
sigmhcant  is  the  architet  tuie  itself  and  ifie  eonteiil  posited  to  produce  the  task  hchavioi 


3.  .2  Mu  archilt  ctiiro 

1  Ins  S'  ctiiii!  p! ()\ idcN  ,1  fu  K  i  o\ei\  u'\s  lit  the  esseiili.ils  ol  iht'  .Soai  .nchiUa  [nic,  .Milumgli 
.Sis.if  S'. is  liisi  desenhed  as  an  .uliticial  i!iielli>’.c.ncc  ssstcmi  (l  auil,  Ncvsell  A  Koscnldooni, 
1  ‘mS  1,11  finci  :'fd  as  ,i  ilici  u  \  <'t  die  human  cogmu  ve  aichitsa'iuic  in  New  cli  s  I  Oh 7  Willmsii 
JaiiKw  Ics'lincs  s.Ncwfl!,  !0'i(i)  ,Si>ai  has  siiue  been  .ifiplicd  !o  wide  larige  di  lonnitr.  c 
l.isks  ilnwis.  Cl  :il,.  |00();  Kosi‘iil)l(ii>m,  Fcniiiaii  A  laud.  I  "o  1 1  |  inuiC  iDinpicU' 

o\ci\!(,\\s  .it  .SiMi  .uid  SiKii  u'si'aich,  sec  (  4  ot  iNcwseil,  loOi.'i,  die  iri.ciil  .ScasO 

I ! i.'iiii.il  f  I  II  d,  (.'ongditii.  4  li inaiin  A  1  tooiciituis .  too  i  i  .md  the  i et  cm  cdiiCi!  col Iccnuii  o! 
pai  III  s  (  [\0' c  nl'locmi  1  .iii.i,  A  Ni'wcll,  1  oo  gi  | 


3.!.  Preliminaries;  architectures  and  Soar 


Fundamental  components 

The  basic  components  of  Soar  are  shown  in  Figure  3.1-^.  All  behavior,  from  routine  to 
difficult,  occurs  in  problem  spacer  'own  as  triangles  in  the  figure.  A  problem  space 
is  a  formulation  of  a  task  as  an  initial  state,  a  goal  state,  and  a  set  of  operators  that 
apply  to  states  and  produce  new'  states  (Newell  &  Simon,  1972;  Newell,  1980;  Newell, 
1990).  Any  application  of  operators  that  yields  the  goal  state  is  taken  as  a  solution  to 
the  problem.  A  working  memory  holds  the  momentary  problem  solving  context:  aspects 
of  the  problem  space,  state,  and  operator  are  represented  as  declarative  attribute-value 
structures.  Knowdedge  about  how  to  apply  and  .select  operators  and  problem  spaces  is 
held  in  a  long-term  recognition  memory^  w'hich  continually  matches  in  parallel  against 
the  declared  context.  The  recognition  memory  consists  of  a  large'*  set  of  condition-action 
associations  (productions).  The  conditions  .specify  patterns  that  occur  in  working  memory, 
and  the  actions  retrieve  knowledge  in  the  form  of  preferences  to  change  aspects  of  the 
problem  space  context.  All  long-term  knowledge,  whether  declarative  or  procedural,  is 
held  in  the  uniform  recognition  memory. 

A  step  in  the  problem  space  (e.g.,  an  operator  application,  or  an  initial  state  selection) 
is  taken  each  decision  cycle  (bottom  of  figure).  The  decision  cycle  consists  of  tw'o  phases. 
During  the  elaboration  phase,  recognition  memory  marches  against  working  memory,  and 
associations  fire  in  parallel  and  in  sequence  until  quiescence  is  reached,  that  is,  until  all 
the  relevant  as.-.ociations  have  finished  firing.  At  quiescence,  the  retrieved  preferences 
are  interpreted  by  the  decision  procedure,  wiiich  determines  the  next  step  in  the  problem 
solving.  The  dv;'cision  procedure  simply  implements  the  semantics  of  a  fixed  prefcience 
language,  which  allows  a  partial  ordering  to  be  detined  ovei'  problem  space  alternatives. 

If  tlie  retrieved  preferciices  uniquely  determine  the  next  step  to  take,  me  decision 
procedure  effects  that  step.  In  sucli  a  case  Soar  proceeds  by  reeognilioii.  But  this  need  not 
lx‘  the  ease;  kiiuwledge  mav  be  meonsisient,  in.eonelusivc,  nr  missing,  causing  an  imf>asse 
to  aiise  alter  (juir  seence  Soai  responds  to  impasses  by  .etung  up  a  new  problem  sjxirc 
in  vvtiieti  to  ileiihi'nilely  acquire  tlie  necessary  knowleii.’c  Impasses  inay  oecin  on  any 
|)n)hkmi  S[iace  luncliori  i  ui  example,  f  igure  .7  I  s1h>ws  iwo  impasses:  the  top  imp.isse  is 
tine  ic>  hick  of  know  letige  it)  select  ani-:mii  a  stU  ol  opnah'is.  anti  lower  one  is  tine  !o  Kick 
t>i  kiU'wledge  to  iqiply  an  operator  lm])as,ses  ma.y  ccciir  nitlelmilei)’,  leading  to  a  sid/gucd 
hier.irchv  in  vvorkiug  inenioiv  Any  imirasse  (not  jnst  the  last  one)  nia\  be  re.^ol\ed  at  anv 
lime,  ifsLiltme  iii  an  .mlom.ilic  collapse  o!  the  siib('(i;iis 

As  know  k  dge  liom  a  lower  protilerii  space  i\  ,u'cmnul:iletl  to  resnit  c  a  Inghei  iiiijiasse, 
So.n 's  Icarmiig  ini‘ch..anisni.  i  liurik;ng.  bii i ids  uev,  .issot nations  m  Uie  recngmlinii  memnrv 
til. it  will  lei.netc  the  kiii '.wiedge  m  lelet.iiii  toniexis  1)\  iiiatclc  1  has,  m  luune  sinnlai 

Ih,.'  f  I  ipl  ft  ''I  in  tli:  \  hatMi.T  nD.xOs  pftihlcin  sc);uf*  a.n!l  nict  !f  jiiiMn',.  ilicir  i- 

a  vu'W'iW)'  \ic\\  tli.ti  .1  /’D'/'/.rn  A/>.ii  r  t  .  - ‘fn//  /Nih/f  /  'P.Nt  .vli  <.  iifi  It'  iks.yihcti  i 'uitpe  tii  k  ni !  \  of 

p.ii  ■:  h  111.  i>:  w  nilb  >i  k-'o-I  liiipli'iiu-iii.if  h  Mr-.  <  NvA\  .-i!  t  {  .i!  .  !  wrM  i  \\u:  I  dSi  'M  is  rdscnl  ia!l .  .i  s[ i  In  Ail  it  to  ift.il 
ifiiKCs  sNiilht'i  irx'vl  ailU  kn-o”.  Inlr'r  k.'vrj  v  oniptMlrniA  It'’  thr  [MM  pt  t-.f  ■  -M  iiii-  ihrsi'.,  'fr‘  nioir 

!  !  -  P  i  i !  h  )|!,1 1  I  k  '  t  f  1  pl.li  >n  I  >i  Sv  Atl  \M  i  I  S'.!  fu  C. 

■*  1  [ir  I.M  1  V  s,  1,1!  sysiv-iD  V Milam  --  tM  H  ),( H  H  ‘  pi  o.Jtn  !  dama  -  1  .k‘0!  ri'l  a-  .  i  I  ,  ; p  i  )  >,  ,r  rr  i 'o  , ,  pc  ?  m  'p ai 
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situations,  the  impasse  and  the  d'diberation  it  leads  to  may  be  avoided.  Chunking  is  part 
of  the  architecture;  it  happens  continuously  and  automatically.  Although  there  is  only  one 
learning  mechanism  in  Sv.>ar,  chunking  works  over  every  kind  of  impasse  in  every  kind  of 
cognitive  activity,  giving  rise  to  many  kinds  of  learning  (Steier  et  al.,  1987;  Newell,  1999). 

Memory,  process,  and  control 

As  with  any  functionally  complete  architecture,  we  can  identify  tne  memory,  process 
primitives,  and  control  .structure  of  the  Soar  architecture.  Memory  in  Soar  consists  of  the 
large  recognition  memory  and  the  declarative  working  niemory-\  Both  are  unbounded 
memories,  but  with  very  different  properties.  The  knowledge  held  in  recognition  memory 
is  only  retrieved  if  the  appropriate  cues  are  present  in  working  memory.  The  associations 
cannot  be  modified  or  temoved  once  added,  nor  are  they  exaininabie  by  o'her  processes. 
The  burden  of  providing  free  compo.sability  rests  on  the  working  mt  /nory,  which  can  be 
rapidly  deployed  by  the  system  as  ii  ;  fit. 

The  processing  primitives  in  Soar  are  the  basic  operatiom-  available  to  alfect  niemor}' 
For  the  working  memory,  these  include  operations  to  add  and  delete  new  attribute- value 
structures.  For  the  recognition  memory,  the  operations  are  matching  and  chunking.  How¬ 
ever,  unltke  the  working  memory  primitives,  the  operations  on  recognition  memory  are 
not  modulated  by  knowledge;  that  is,  their  evocation  does  not  depend  on  ihe  contents  of 
memory.  (The  results  of  tlie  match  p>-oces.s  do,  of  course,  depend  on  the  contents  of  working 
memory.)  Teaming  and  match  are  continuous,  automatic  proce.sses.  No  retrieve  or  .stoic 
ofieiadons  ajipear  in  the  actions  of  associations. 

Soar's  control  structure  is  a  co  .binatioii  of  the  iccognition  match  and  Ihe  decision 
cycle.  Fach  processing  Sicn  defends  on  the  piclerences  leirieved  al  that  moment  by  the 
coruiition-actioii  pare;  in  recognition  memory,  d  he  hxed  decision  cycle  processes  these' 
preli'reiu:es  to  deicmimc  the  iie.xt  problem  space  step  ITe  co’itiul  is  liiereioie  opeir  since 
-ill  die  knou  lc'tige  ui  long  leriu  luemorv  is  bioughi  to  bt  !'  at  each  [loint,  .md  the  control 
niemorv  is  open  to  addition  ibv  cliiinkiiig)  1  Ins  recoyni:’  tleciile  m:f  siiucture  contrasts 
wiiti  the  /('.'</;  </c(  ('./(■  istcmc  cycle  ol  siandaid  conipiitfs  ■  s  bn'cciMi  es,  \\  liicli  icsti  icts  tfic 
active  coniiol  memoiv  to  a  liK:'al  piece  ol  piograni  hxed  !s.  P.i  ■  ci  :  m 


IVreeptimi  and  action 

No.i:  niter. lets  vviili  llie  outside  world  ihrougti  peiceptual  iiui  motor  modules  (tia!  make 
c  I'litaci  u  idle  Cl)  Ii  a  I  cognition  \  i,i  cvoi  king  iiiemor  v,  spec  i  ticaii  v,  tliioiieii  tlie  sl.ile  o:  die  lop 
pi ohleii I  s[),ic  c  ( i  is'iii'c  ’  !  )  Antononious  cncoi/ore  pioiliK'tioiis  j).\i  sv' pc'ict'pliia!  in[Hit  lo 

pn,’  n  n  c  it  t  ( *1  c  o  iiii  1 1 1C  1 1 .  sii  1 1 1  fill  1  \  ilraotht.'v  pi  otiiic't  ions  pr  c*p.u  c‘  ii  urt  >i  con  ui  ni  i  kl  s  ten  d  1 1  cc  t 
vc'ciii loii  (  not  sliov,  11  111  t tic  liinitc )  I  iico.linv  and  dc'codi ng  |t!  odiu  i mils  ,iu  .uilononio! is  in 

I  illlc  lie  ,!Oim!I\  illlrl  IMfllKH  !e',  111  Suit  ll'ii  ilivll'l;'  lIs.'  |MeleiriK<'  MKHHIO  Ii  '  tl'lUl  ifu  I  'iceved 
I'l e  ler.'iu.  !  Ii  ■  ts:  i  niei  pi .  n'll  In  '.tie  Je,  isn iri  e  \ ,  ■  e.  .iiici  .-  ineiih 'i '.  1. 1  lie'kl  )mi  !  i.ii  (n.iC  Ii  I ii !,  im  i.it n  >11  S  s  Ihe 
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Table  3. 1 :  The  time  scale  of  Soar  processes. 


Architectural  process  Cognitive  function  Time  scale 


Search  in  problem  space(s)  Unit  task 

Operator  implemented  in  subspace  Simple  composed  operation 
Decision  cycle  Elementary  deliberate  act 

Recognition  memoiy  match  Distal  knowledge  access 


~~10  s  -  minutes 
~~l  s 
~~100  ms 
-'  -''10  ms 


that  they  do  not  depend  on  the  current  problem  space  context,  and  they  fire  independently 
of  the  decision  cycle. 

The  perceptual/motor  side  of  Soar  is  presently  underdeveloped,  though  it  is  an  active 
area  of  research  (e.g.,  (Wiesmeyer,  i992)).  The  actual  implementation  lags  behind  die 
model  presented  in  (Newell,  1990);  in  particular,  the  encoding/decoding  scheme  has  not 
been  implemented.  For  tnc  purpo.ses  of  this  thesis  however,  the  details  of  the  perceptual 
motor  system  will  not  play  a  major  role. 

The  temporal  mapping  of  Soar 

Newell  11990)  provides  mi  analysis  that  grounds  Soar  tempt)raliv  as  ;i  tno.lel  of  human 
eogmiiini.  'Flie  resiills  are  summari/ed  in  'I'ahlc  3.1.  The  analy.sis  is  c  .msiraii'cd  f'Oin 
above  by  the  f  unctional  requirement  to  yield  cognitive  lieh.ivioi  in  about  a  second,  vir  a  few 
hundred  milliseconds  in  the  most  elementary  reaction  tasks.  The  analysis  is  eonstraiiied 
from  below  by  ba.sic  temporal  properties  of  neural  circuiliv  disla!  processing  (beyond 
local  circuits  of  about  a  cubic  niilhmeter)  cannot  liappeii  fasiei  llian  roughly  10  ms,  since 
(he  charaetensiic  processing  'unc  of  local  eirctiils  is  --  ^  1  ms.  1 1  '  elenieniary  lunetion  of 
distal  ae  ci'ss  ni  Soar  in  pro\  ided  recogiuiion  uiaich.  so  the  iccoginlion  nuilch  ninst 

lake  on  the  order  ot  10  ms.  !  heic  is  only  room  foi  two  more  system  levels  between  distal 
,1'.'  ess  aiul  eogii  li\'e  tinuMiou,  coi u  spi'iuime  to  tli  '  deeisioii  i  vcle  and  comp-isei,!  opi'ratoi 
111  Sou  1 'orhiiuaelv,  boili  the  bottom  an  iiid  lop  ilown  an.ilyseN  yield  ec-nsisieiu  icsulis 
for  Soar.  As  Newell  lei'caletll  c  emph.isi/cd,  these  \aiiu‘s  should  not  be  taken  as  pie'  ise 
\aliics.  Hit  i.illiei  oidei  ot  magnitude  esiiiuatcs  t  Newc  h  nsed  the  notation  lOOloinean 
9)  UM,)).  WiesmcNCi  I  I'-LLfi  ane'  .loluisoii  i  aird  (  19HK)  use  a  eonsianl  ol  SO  ms  [xn  operator 
III  in.ik-'  quanliiatiw:'  piedicaor.s  .icross  a  i.inge  oi  immediate  iCsponse  tasks 

(  nvei  age  ui  Soar  as  a  psyehologiea!  theory 

Soai  li.iN  bet  II  ,qi|)lied  (o  .1  '.^lde  '.iiigeol  i  <>gnili\e  pheiioincna,  itK  liiding  iiniiuaiiaie  leac 
iioi!  tasks,  ci.f  sie  puilil  '111  s.ihing  pu/zles  (e.g  ,  loweis  ot  Ikinoii.  veiiiat  le.isoiiiii!.’  le  g 
s',  llogisn  s  a  and  lepeiiio  >;  pi  ,k  t see  el leeis.  Newell  i  lOOui  is  sal!  ihe  lUosi  comi'i etu  ii 
■aw  letcu'iiee;  l.ewis  et  al  ilU-.Hti  anil  Koseablooin  ei  ai  il'TJdo  pnoidi.  nane  leccnl 
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3.1.3  Language  and  architecture 

How  does  language  comprehension  (and  proauction,  and  acquisition)  fit  into  an  overall 
theory  of  mental  architecture?  This  is  the  modern  computational  version  of  discovering  the 
relationship  between  language  and  thought. 

Most  psycholinguistic  work  does  address  architectural  issues  to  some  extent.  For 
example,  distinctions  are  drawn  between  between  modular  and  interactive  architectures,  or 
automatic  and  cognitive  processes.  But  there  have  been  relatively  few  explicit  proposals  of 
functionally  complete  architectures  for  language  processing^.  Exceptions  include  Marcus's 
(1980)  PARSIFAL,  the  CAPS  architecture  underlying  CC  READER  (Just  &  Carpenter,  1992), 
the  PDF  architecture  of  St.  John  &  McClelland  (1990),  and  the  annealing/activation-based 
architecture  of  Kempen  &  Vosse  (1989).  .All  of  these  models  make  explicit  assumptions 
about  the  control  structure,  processing  primitives,  and  computational  resources  supporting 
lingui.stic  behavior.  (The  Sausage  Machine  (Frazier  &  Fodor,  1978)  was  a  step  in  the 
architectural  direction,  but  was  never  specified  in  much  detail). 

Fewer  still  are  those  theories  that  relate  tne  architecture  of  language  processing  to  the 
architecture  of  cognition  generally.  This  is  largely  a  result  of  an  assumplion  in  most  theo 
rizingthat  linguistic  processing  is  modulai  (Fodor,  1983).  Thus,  when  explicit  architectural 
hypothe.ses  are  made,  there  is  often  no  attempt  to  generalize  them  beyond  language.  A  no¬ 
table  exception  is  the  work  of  Carpenter  and  Just  —their  t'AP.s  architecture  forms  the  basis 
of  a  general  cognitive  theory  that  has  been  applied  to  some  nonlinguistic  tasks  (Carpenter 
et  al  ,  )'>9()), 

Modularity  detines  the  first  choice  to  be  made  in  ileveloping  a  Soar  theory  of  language 
crMiiprehension:  should  a  speend  linguistic  input  system  be  [losited  outside  ot  the  existing 
architecture'^  Or  sliouid  the  Ciunpreheiision  cafiabihty  be  (.icveloped  within  tlie  given 
mechanisms'^  If  \vc  take  wliat  a[)()ears  to  be  tlie  iiie.rlular  route,  we  esseiitiallv  slarl  fie.sli 
with  respect  to  detiiiing  tlie  control  structure,  processes,  and  memorR  s  tor  compreheiisum 
file  mieiesimg  issiu.'  itu.'n  becomes  the  iiatuie ol  the  inter! ace  ecu  Soar  aiul  the  llIli’UlsIlc 
module 


!  he  INI  Soar  ap[)ro;u  h 

Die  alleriiatne  a['|iroa(.'h.  and  tin-  one  ue  have  avioplcit  wilh  N1  Soai  (Ichman,  l.c-wn. 
A'  .\euell,  l‘toi,!  lOOib,  I')')-;,  l.etimaii,  Ne'vsell,  Folk  A  l  ewis,  P>0v  i  ew as,  lOOPi, 
I  on  ’,b  1  e w  Is,  Ns' w  ell,  A  Folk.  FkSO  Ne w cd i ,  i  ‘iS  i  00( ) \  |s  to  ea iii nai  the  i  oin| a cliv'iision 
capatnhlN  within  I  he  e  \  isiing  .lu  hileiture  Ibis  is  itu'  [lath  uiy’ed  iw  '^.'tw'.ell  in  Fie  Wiliiani 
Finn's  U'v'iini,"v,  vslieie  tie  liisi  ■keuhi'il  the  So.n  iheois  oi  » <  n  i  ipi  clu  iisn  m  h  i,  ^  .';Mliall\ 
a  I  iiiiiimalisi  .ipj'i  o.icti,  .lilempling  lo  see  iu'w  i.n  ilie  piestail  incv  tiani  .nis  c.iii  be  pushed 

'll:  1.1',,  I  (he  M.llMl  Ml  sf  cniiAl  NClh  ‘Us  t  ll.  Hi  '  li  In  I  Ol  xli'l  (  i“'  1  I  h.lt  If.‘  I  Xsllicl  i  ':C  MC  ll  ;  .tit  ( -  M  i  IF  'i  F 
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before  positing  new  ones.  Newell  clearly  viewed  the  success  or  failure  of  the  venture  as  an 
open  question,  with  no  a  priori  resolution.  It  could  in  fact  turn  out  that  Soar  is  not  up  to  the 
phenomena  of  real-time  language  comprehension. 

Such  an  approach  may  seem  to  be  completely  at  odds  with  the  modularity  hypothesis, 
and,  more  to  the  point,  with  the  evidence  accumulated  in  favor  of  it.  Real-time  processing, 
automaticity,  etc.,  may  seem  to  be  thrown  out  in  favor  of  a  view  of  language  as  general 
p-mblem  solving.  Prejudging  the  approach  in  this  way  is  misguided,  however.  Even  in 
a  modular  architecture,  there  must  exist  some  relationship  between  linguistic  and  nonlin- 
guistic  processing,  and  there  may  even  be  architectural  principles  in  common  across  the 
modules.  The  apparently  non-modular  re,«earch  path  we  have  taken  is  one  way  of  discov¬ 
ering  these  commonalities.  More  importantly,  the  remainder  of  th  .,  thesis  should  make 
clear  that  the  basic  phenomena  of  real-time  comprehension  are  dealt  with  in  considerable 
detail,  and  the  approach  has  led  to  an  even  richer  understanding  of  modularity  than  might 
otherwi.se  have  been  possible  (Chapters**  i  ad  9). 

3.2  The  ba5-I  i‘  ,structure  of  NL-Soar 

Building  a  i,  ('mprehension  model  in  Soar  requires  specifying  the  probleivi  spaces  and  oper¬ 
ators  that  achieve  (he  (mH  tions  of  compichension.  This  section  lays  out  the  Itasic  structure 
ol  N1  -Soar  m  these  ici.ns  The  first  ordei  of  business,  therefoie,  is  not  desciibmg  Nl.-Soar 
along  tiaditional  diniensn«:,.> '  '  parsing,  such  as  top-riown  or  bottom  up  or  lelt  comer,  but 
rathe!  speed  sing  hosv  >  >.r  ipici  ension  is  reali/ed  in  .Soar's  architcctur.il  mechanisms.  Of 
course,  the  traditiomil  eh..r,  len/ations  .irt  both  possible  ami  useful  but  they  are  only  part 
o'  the  story. 


3.2.1  C'onipirheusioii  ttporalors  and  (ho  real  (inie  ennsi  raint 

Soai  eompi eheiids  l.inge.igi'  b\  a[ipl\  mg  i  oinjorlicnMon  < y 'c.'r  !<>> s  to  die  meui mng  Imgu is 
lie  mpic  I  liese  oiiei.ilois  pioduce  sMil,ti.  lu.'.  scm.iiitu  ,  .uui  .eieu  nlial  ie|iie.senl.ilions  in 
ssorkiiii’  memeus  (>J.I  ).  l  iaa. iiomiiis,  i. ompieheissioii  opei.iiois  aee.mijilish  the  m.ip|>mg 

I  I  S  !<■  r  ■  /  '  .V  .A' 

\s  hei  e  /  ei  i  K's ponds  pi  p.  .ssihle  incoming  I mcuisl ic  mpiil ,  /  .V,  ,iiid  ‘V  coi  i csisond  lo  j'os 

Slide  sNiii.u'tic.  si'm.inlii  .  .iiul  icl  "ctir.ii  o  liu-sciii.'iioiiv,  lesr'Ci  t  i \  rd \  .  .md  /’ coi  responds 
ici  itk  cmiciil  piohlciii  stdsmg  .i.>ulc\i  it  iclt.s  i('  iht  'Uiciaikc  o  [';(.'sciil,ilii'i!.  lo  'n' 
r'  \ [d.uiK'd  m  .1  mull u.-ilt  1 

I  mmcdrics  i  cqui  i  cs  dril  I  he  r  ompi  ehcnsio!!  o|>ci  .il.u  s  r  !  feel  this  m.ippmg  iru  i  eimT*!  al  K 
1 1  \  (.■ !  ,1  a  1 1.1 1 1  i  im  M .  c  of  imgii!\f  k  mpul  i  .it  lea  -  i  .a  die  ss  rad  le  \  r  1 1  i  1 1  \  rii  1  h  ■  iL'dipoi  ,il 
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Figure  3.2:  Recognilional  comprehension.  A  stream  of  comprehension  operators  applies  to  the 
incoming  linguistic  input.  There  are  three  types:  syntactic  ope.rators  (U),  semantic  interpretation 
operators  (S),  and  reference  resolution  operators  (R). 

four  or  five  operators.  This  is  the  first  and  most  serious  constraint  on  the  NL-Soar  model. 
It  means  that  comprehension  must  proceed  mostly  rccognitionally,  in  other  words,  without 
incurring  impasses  and  engaging  in  deliberate  problem  solving.  The  knowledge  to  propose, 
select,  and  implement  the  comprehension  operators  must  be  immediately  available  via  the 
recognition  memory  match. 


3.2.2  The  structure  of  comprehension  operators 

Even  with  the  tight  constraint  provided  by  real-time  immediacy,  there  are  a  number  of 
alternatives  for  structuring  comprehension  operators,  corresponding  to  different  ways  of 
distributing  knowledge  across  the  v  arious  problem  space  functions  (operator  proposal, 
operator  selection,  and  operator  application).  We  will  consider  this  space  later  in  §3.7,  but 
for  now  simply  posit  the  following  three  kinds  of  comprehension  operators: 

•  U -constructors  build  the  utterance  model,  which  represents  the  syntactic  structure  of 
the  utterance. 

•  S-constructors  build  the:  situation  model  which  represents  the  meaning  of  the  uTer- 


•  Resolve  operators  perform  rclcrcncc  resolution  by  recognizing  parts  of  the  situation 
rncdcl  as  descriptions  of  previously  mentioned  or  known  entities,  and  elaborating  the 
s'.lualioii  model  with  tha!  infonnalion. 

( -d  vs'ri  these  ivpes,  lagnrc  v2  :;liows  an  ex,un[.)!e  ol  what  leroi'nitional  compiehension  looks 
iikc  .is  a  sfoiaoi  e!  eoinpiciieiision  opoiaiors  Nonce  liial  every  word  lU'ed  noi  evoke  all 
tlirci-  o[iciaiO!'  ivpes,  ihai  ioote  ihan  one  :)pefa!or  ol  a  given  iype  may  agiplv  [)Ci  vviiuf  and 
tlieie  i--,  no  lived  (luieruig  of  ap(.>lieati(.n.  All  lhat  tins  ;>(  lu.’Uie  suines  1n  linn  the  set 
ot  opeiaUMi.  Is  Miifieien!  hn  nu  lenienialiv  eonMiueting  he  (a>nipie;;  ssion  ilata  sirnetures, 
wi'ti  some  dr  imoii  oi  lahi>!  aniong  the  ,iitfcreni  tvfx::-..  ‘■iirlhcrnKa e,  n  ^il,^,i  he  tiie  ease 
ifia!  ihe^f  vitieralots  luiist  iarisih'  bung  !<>  hear  UMilspie  k nowledg.'  soiiui's,  p  i!ie\  <iie  t.i 
ai  Cii'iiPiish  ifie  sminned  mapping  ;n  .1  jiist  a  hov  upet.itoi,-, 


78  Chapter  3.  The  NL-Soar  Theory 


3.2.3  F rom  deJiberatioii  to  recognition:  comprehension  as  a  skill 

'lb  reiterate,  achieving  the  purely  recognitional  comprehension  illustrated  in  Figure  3.2 
requires  that  the  proposal,  selection,  and  application  of  the  comprehension  operato;s  be 
accomplished  directly  by  associations  in  recognition  memory.  Where  do  these  associations 
come  from?  The  architectural  answer  provided  by  Soar  is  that  they  must  arise  e.vperientially 
by  chunking  (or  else  they  are  innate). 

.In  fact,  the  N.L-Soar  model  does  not  specify  the  associations  that  directly  perform 
comprehension.  It  specifies  a  hierarchy  of  problem  spaces  that  give  rise  to  the.'-'e  as,sociations 
via  chunking  Figure  3.3  shows  the  basic  structure  (the  details  vvil!  be  provided  in  the 
remainder  of  the  chapter).  When  an  impas.se  occurs  due  to  lack,  of  immediate  knowledge 
to  accomplish  some  comprehension  operator  function  (proposal,  selection,  or  application), 
NL-Soar  enters  these  lower  spaces  where  independent  knowledge  sources  may  be  brought  to 
bear  in  a  search  for  the  correct  utterance  or  situation  model.  When  the  i.mpasse  is  resolved, 
chunking  automatically  builds  a  new  association  in  recognition  memory  that  should  allow 
comprehension  to  proceed  smoothly  in  future  similar  situations.  These  associations  may 
be  quite  general,  or  they  may  be  quite  specific,  depending  on  the  nature  of  the  impasse  and 
the  problem  solving.  We  call  these  associations  chunks,  though  all  associations  have  the 
same  form,  whether  posited  by  the  theorist  or  created  by  chunking. 

One  important  characterization  of  comprehension  that  emerges  from  this  model  is  that 
comprehension  is  a  mix  of  recognition  and  deliberation.  Given  the  .severe  time  constraints, 
comprehension  must  be  mostly  recognition — an  automatic,  rapid  process.  But  there  is 
always  the  capability  to  fall  back  on  the  deliberate  spaces  when  recognition  fails.  Just 
how  much  of  adult  coinprehen.sion  consists  of  recognition  vs.  deliberation  is  an  interesting 
theoretical  and  empirical  issue  that  will  be  addressed  in  Chapter  7. 


Another  important  chararterizatioii  that  emerges  from  this  model  is  comprehensinn 
as  a  continuously  improving  skill.  To  be  clear,  NL  .Soar  does  no!  specify  a  theory  of 

language  acquisition . '.he  language-specific  knowledge  in  the  lower  space  is  posited  by 

the  theorist.  However,  it  does  specify  ll-at  certain  aspects  of  eornprchensioii  wiii  always  be 
open  to  irnpiovenient.  As  we  will  see,  the  ability  to  handle  c’lfficull  syntactic  constructions, 
ambigiKais  material,  and  contextually  specific  interpretations  may  all  be  modulated  by 
el'iinking.  I  .anguage  learning  does  not  stop  with  the  acquisition  r>l  syntax  or  vocabulary. 


3.3  IMie  ytlen»ifc  nsodeS 


i  iiis  seeOiiM  (it'svi il'C;.  (lie  siiui  tinc  >>)  ihe  iilleiaiua'  iiitKlei  ,inil  tnc  pi oi  case's  tor  liuiiding 
!i .  1  lie  desc!  ([)' ion  ;n  raiu'K  in  sx  iitar  no  ler  ne.,  iiiui.  [soiiicni  sd  m  man  tics  and  conies  i ;  dse 
iiitcKii.  (Ion  ol  -.vacas  ss  iili  oifici  kmie  k  dgc  sources  svil!  t'e  cxplorasi  in  §3 Us 
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Recognitional 


Figure  3.3:  Comprehension  as  deliberation  and  recogn'tion.  The  operators  in  the  top  space 
achieve  comprehension  by  recognition  when  they  are  propose<l,  selected,  and  applied  directly  by 
immediate  memory  retrieval.  If  the  required  associations  are  not  present  in  long  term  memory, 
impasses  arise  and  the  relevant  functions  are  carried  ou*  deliberately  in  lower  spaces.  As  impasses 
are  resolved,  /lew  chunks  .rre  fonne-J  that  peifc'irn  the  function  by  recoguitieti  in  future  similar 
situations. 


3.3 J  'What  the  utterance  model  repre^ctits 

The  utterance  model  represents  X  bar  phtasc  stnietuie  as  assumed  in  (F  vernrnent  and 
Binding  thetary  (e.g.,  tC'homsky,  F'SG;  Cowpei.  1992)),  Because  many  irl  the  predicfons 
described  later  in  the  diesis  are  sensitive  to  syntactic  strueiuie,  selecting  an  existing  syntactic 
them  Y  helps  guard  against  ,id  here  analy.sts  Uiat  will  lad  to  hold  across  a  wider  range  of  cross 
linguislie  slciu  t,.res.  The  riartienlar  clunee  o!  (iP.  str.ieltnes  was  uiaik'  on  both  pmgniatic 
.tn  1  iheoretieal  gtonnd.s.  Ih'aginatieally,  usntg  CiH  allows  foi  a  inon  direct  couip.u  ison  v,  ilh 
ihe  ircern  detailed  inod.cls  eif  Prnelict! '  1 9‘>2 )  anti  <  iidson  (  i ')'■>  1  ),  is  v.ell  ,is  .i;i  lucoi poi anon 
od  iisetii!  .ispei  ts  ut  those  nunfels  and  analyses.  ’!  beoietiriailv,  the  esjalu  it  ornii  iples  and 
pill  anietc!  s  apj:!!  oac!!  ins  n.nar.jtly  niio  the  const  mini  !  siseii  geiua  ate  and  iCst  it  a;  tie", a  a  k  ol 
Nl  So,i[  olesenlici.l  in  die  next  seeiion) 

'['be  b.ish.  Iku  sehenia  is  shosMi  ni  higurr.  3  4,  X  mnges  tAC!  the  sinlaeti  '  t  aicgvines 
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X’YXP) 


X  Z”(ZP) 


Figure  3.4:  X-bar  schema.  YP  is  in  specifier  position  (spec-XP).  ZP  is  in  complement  position 
(comp-X’). 

A  (adjective),  C  (complementizer),  I  (inflection),  N  (noun),  P  (preposition),  and  V  (verb). 
There  are  two  levels  of  phrasal  nodes  projected  from  lexical  heads:  X’,  and  .X”.  X” 
is  assumed  to  be  the  maximal  projection  and  will  usually  be  denoted  XP.  (Inflectional 
phrase  (IP)  corresponds  to  S  in  more  traditional  phrase  structure  grammars;  complementizer 
phrase  (CP)  corresponds  to  S’.)  The  set  of  available  .syntactic  relations  between  nodes  is 
{spec,  comp,  cornp2,  head,  adjoin,  adjoin-head},  which  denote  the  structural  positions 
of  specifiers,  complements,  heads,  and  adjunction.  Adjunction  will  be  explained  in  more 
detail  below.  Syntactic  structure  is  thus  a  strict  tree  with  a  typical  branching  factor  of  one 
or  two.  Figure  3.5  gives  the  X-bar  structure  for  a  complex  noun  phrase  ,  with  the  structural 
relations  explicitly  labeled.  In  future  tree  diagrams  the  relational  labels  will  usually  be 
omitted.  Some  intermediate  nodes  may  also  be  dropped  for  brevity. 

3.3.2  How  the  utterance  model  represents  (or,  why  call  it  a  model']!) 

We  now  make  a  general  assumption  about  how  mental  models  (§2.1.1)  fit  into  Soar: 

(69)  Models  assumption:  States  iii  problem  spaces  are  annotated  mental  models, 
which  are  pure  models  with  a  limited  set  of  annotations  or  tags  that  expand 
the  re[)rescntational  .scope  or  help  control  processing  (Newell,  1990), 

Ibis  icprcscniatK ’iial  assumption  is  adopted  in  most  cognitive  modeling  work  in  Soar.  It 
grew  out  ol  Polk  and  Newell 's  (  1 988)  work  in  modeling  syllogistic  reasoning,  w'hich  sought 
to  explicate  the  role  of  menial  rnotlcis  (ala  Johiison-Faird)  in  Soar.  As  slated,  it  takes  the 
lo'in  of  an  aichitectiiral  assumption,  since  it  cuts  across  all  tasks  and  domains  Hov/ever, 
as  Newell  >  i9‘:())  pomts  out.  llie  attribute  value  scheme  in  .Soar  is  neutral  with  resfiect  to 
the  use  of  modefs  oi  eon  tnodci  lepresentalions  Thus,  there  ts  still  an  imcior  taut  issue  as  to 
vvliat  m  .Soar  should  give  use  !*>  this  rcsiriction.  Without  making  any  commil ments  to  the 
genesis  of  iiiodcis.  we  simpls  adofU  itiO)  .is  a  umfonii  repiawcnialional  law.  The  [liiiiiaiw 
tunciioual  arhaailage  oi  riioilels  is  coiigiui.uional  ellicieucy  the  knowledge  raieoded  m 
model  ■■  cati  be  l  yiiaeted  vs  ith  match  like  proi.'essmg. 

'he  iminediaie  i.  (insequenee  ol  adopting  ihi.s  assumption  wit!;  K.-spts.  t  to  ssiiticUi 
siuietiin.'  Ill  N1  Soar  is  that  ifie  liapresentaiion  o>f  sviii;!X  mu'-t  t>e  ,i  nieniei  We  t  all  ihis 
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liGUKP  3.5;  X-bar  phrase  structure  for  the  coniplex  NP  the  thought  that  John  u  t/.v  hitting  the  ball. 


representation  an  uttcratu  e  model  because  it  is  a  model  of  the  structLire  of  the  utterance. 
1  he  Litleiaiice  model  must  satisfy  the  .structure  correspondence  principle  (§2.1.1),  which 
states  that  aspects  (d  ;t  model  correspond  directly  to  aspects  of  the  represented  domain. 

■f  he  reali/iition  ol  the  utterance  model  as  an  attribute  value  structure  is  straightforward: 
attributes  correspond  to  the  structural  X-bar  relations,  or  syntactic  features  such  as  category 
or  agreement  I'lie  values  ol  the  attributes  correspond  to  other  objects  in  the  model  (i  c., 
nodes  in  the  tree),  or  constants  ie[uesenting  the  values  of  the  syntactic  features  Figure  r  b 
ilhistiates  a  simple  e,\am[)lc. 

riie  mode!  icsiiiclion  mav  .seem  so  weak  as  to  [uovide  little  conslramt  in  tlevelofung 
rcpaesentiilions.  Hut  in  tad,  some  lamihai  representations  loi  syntax  arc  luled  out  iuc 
('anse  llic\  'violaii.'  stsiK  lure  du'rcsjiondeiice.  I’tire  logic  re|>ie\eiitation'.  aie  noi  niodels, 
as  discussed  eaibcu.  the  ch.art  data  striu'tures  tluil  urideihc  the  most  etticicnt  lo.^itext  tree 
p;iiscis  '!..;i[lev,  F'/t),  Winogiad,  IHKdlalso  violate  sUiictiir-.  coricsiiond.cnce.  ('harts  arc 
space  and  linic  ctticieiii  ticcaiise  itiev  s\ stematical !>  exfiloi!  tcdundain.  ics  .uaoss  multpilc 
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(p27  "'bar-level  max 
"head  p23) 

(p23  'bar-level  one 
"head  pl9 
“comp  nl7) 

(pl9  “bar-level  zero 
"category  P 
“lex  for) 

Figure  3.6;  Attribote-value  structures  for  the  utterance  model.  Each  node  in  the  utterance  model  is 
represented  by  a  unique  identifier  in  working  memory,  with  a  set  of  feature  augm.entations  (attributes 
and  values). 

Structural  interpretations.  As  a  result,  the  correspondence  between  elements  in  the  chart  and 
elements  in  the  represented  domain  becomes  one-to-many.  The  efficiency  of  constructing 
and  storing  the  chart  is  traded  off  against  the  potentially  increased  computation  required 
to  extract  information  from  the  chart;  for  example,  determining  whether  a  chart  repre¬ 
sents  a  particular  phrase  structure  tree  involves  a  combinatoric  search  through  the  possible 
configurations  implicit  in  the  chart. 

3.3.3  Constructing  the  utterance  mode! 

This  .section  de.scribes  the  processes  that  incrementally  build  the  utterance  model.  First 
we  consider  how  the  utterance  model  is  organized  in  working  memory,  then  trace  the 
construction  of  the  model  from  lexical  access  to  establishing  syntactic  relations. 

The  utterance  model  in  working  memory 

As  illustrated  in  Figure  3.6,  the  utterance  model  is  an  attribute  value  structure.  The  structure 
must  bo  anchored  in  some  t  ashioii  to  the  problem  space  stale,  which  is  a  distinguished  object 
111  the  goal  context  in  Soar’s  working  memory  (ihis  is  an  architectural  retjiurement).  All 
objects  in  a  slate  are  attached  to  the  state  ideiitilier  as  values  of  attributes,  b’or  the  utterance 
model,  one  possibility  is  to  Miiiplv  liave  a  single  alinbuie  which  points  to  tiie  root  of  the 
lihiase  siiucture  iree: 

root  i;39) 

(1.1.39  bd!  ItM/cJ  riiax  hc.i'i  .  .  .  ) 

(  )i  k  iuiiM',  nuiltii'iic  \  allies  will  he  uxpnied  when  the  n!!t,-i,niei.  model  t  on.Msis  oj  nnilhjile 
eonslitueiits  im!  ^e^  gioiiped  mtu  higher  siriKliiies. 
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(state9  ''root  u39  u40) 

(u39  "bar-level  max  ...) 

(u40  "bar-level  zero  .  .) 

The  attributes  at  the  state  level  define  the  initial  access  path  to  the  utterance  model.  With 
a  root  attribute,  the  root  node  is  directly  accej>ible;  any  other  node  must  be  reached  via 
the  root.  This  requires  explicitly  encoding  the  specific  access  paths  into  the  conditions  of 
associations  in  recognition  memory,  and/or  providing  a  general  problem  space  for  navigating 
the  structures. 

Another  possibility  is  to  provide  uniform  access  to  all  nodes  in  the  tree; 

(state9  ‘'all-nodes  u39  u40  u42  u57  .,.) 

These  two  possibilities  define  two  extreme  points  of  a  range  of  possible  indexing  schemes. 
There  is  a  basic  computational  tradeoff  between  the  cost  of  the  recognition  memory  match, 
and  the  cost  of  the  deliberate  problem  solving.  The  root-access  .scheme  requires  encoding 
specific  access  paths  in  chunks,  which  means  there  is  increa.sed  potential  for  deliberate 
problem  solving  when  the  set  of  existing  chunks  is  insufficient  for  accessing  some  novel 
structure.  The  uniform  access  scheme  avoids  the  necessity  of  encoding  specific  access 
paths.  However,  the  uniform  access  path  involves  a  large  multiply-valued  attribute  (or 
multi -attribute)  which  gives  rise  to  unwanted  combinatorics  in  the  match.  The  potential 
combinatorics  caused  by  these  undiscriminated  sets  can  easily  be  seen  in  the  following 
c  ondition  for  an  as.sociation.  Angle  brackets  denote  variables: 

(<state>  ■'all-nodes  <x>  <y>) 

Giver;  n  values  on  the  all-nodes  attribute,  this  particular  condifion  will  lead  to  potential 
instantiations  of  the  association.  In  general,  with  C  conditions  and  W  working  memory 
elements,  the  match  complexity  is  V\T  .  An  association  with  .such  a  condition  is  called 
an  expensive  chunk,  and  the  exponential  cost  of  matching  these  associations  has  been 
demonstrated  repeatedly  both  m  formal  analysis  and  in  implemented  Soar  systems  (Tambe, 
Newell  &  Rosenbloom,  1990).  The  resulting  slowdown  compromi:ses  the  constant  time 
assumption  of  the  match  (Table  3.1 ). 

To  avoid  the  pitfalls  of  both  extremes,  NL-So-ar  uses  an  accessing  structure  that  provides 
sufficient  discrimination  to  avoid  the  expensive  chunk  problem,  yet  is  lutictionally  adapted 
to  tlie  parsing  process  in  such  a  way  that  the  i>  'ev  iiit  nodes  in  tlie  utterance  model  are 
direct iy  accessible. 

The  ulea  is  to  index  nodes  try  the  irotential  s.Mi.i-  iic  iclaisous  lliat  they  may  enter 
into.  I  lie  structure  rs  e.alled  the  A/H  sei,  tor  assignets  and  itvanvers.  [  igiiie  .1/  shows  the 
coni'-nis  of  the  A/R  set  during  parsing  ./o/'m  hii  she  hall,  pist  after  tire  NP  |  ^/.  the  hall]  has 
I'eeii  firrmed,  and  just  befoie  attaching  \si-  the  hall\  m  tlie  conijiletiieni  [hisUiou  of  |i  '  hi!\. 
file  NP  Iv/'  tli.c  hall]  is  a  potential  leceiver  ol  the  relaiiori  nanplenient  al  (oh|ect  u!  a 
wd  ),  as  well  as  speciliei  at  IJ‘  isufr|ev:t  (rosition),  !  he  N'  nodt  j)io)ci  ied  trom  hit  is  ,i 
pots'i.i  la!  assignei  of  ilie  r  iiiriplement  of  V’  relation. 
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Figure  3,7;  The  A/R  set  during  John  hit  the  ball.  [Af<  hall]  can  assign  ati  adjoin-N’  (nriodifier) 
relation,  the  ball]  can  receive  the  comp- V’  relation  (be  a  complement  of  a  verb),  and  so  on. 

Proposing  potential  links  in  the  utterance  model  is  a  matter  of  pairing  up  assigners  and 
receivers  indexed  by  the  same  relation.  For  example,  the  following  condition  binds  to  a 
verb  and  a  potential  object: 

(<state>  ''as.signers-coinp-V'  <vl> 

''receivers-comp-V'  <xp>) 

Once  a  link  is  made,  the  receiving  node  is  removed  from  the  receivers  set  for  all  relations, 
since  a  node  can  only  have  one  parent  in  the  tree.  It  also  seems  reasonable  to  remove  the 
assigners  node  from  the  assigners  set  for  the  particular  relation  established.  However,  this 
can  lead  to  disaster.  The  A/R  set  provides  general  access  to  the  utterance  model  not  only  for 
parsing,  but  also  for  Interpretation.  If  nodes  were  removed  immediately  from  the  assigners 
set  when  links  were  established,  the  node  could  potentially  disappear  entirely  .from  working 
memory  before  any  internretation  could  take  place.  Thus,  the  nodes  remain  in  the  assigners 
set  even  after  links  are  established  (Figure  3.7  shows  just  a  subset  of  the  complete  A/R 
set).  The  assigners  set,  then,  provides  an  efticienl  access  mechanism  for  any  process  that 
works  on  the  partially  completed  syntactic  structures.  For  example,  the  following  condition 
instantiates  fora  verb  and  its  object; 

(, <frtat(r!>  'assigners-comp-V’  <vjl>) 

(<vl>  A'.c5mp  <oh].s) 
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Figure  3.8:  Results  of  lexical  access.  Each  potential  bar-level  zero  node  is  retrieved  and  placed 
in  the  A/'R  set. 


for  the  particular  value  two  is  discussed  in  Chapter  5.  For  now  it  is  enough  to  realize 
that  the  restriction  helps  to  avoid  the  unbounded  growth  of  syntactic  structures  in  working 
memory,  and  that  this  method  follows  from  the  empirical  aiid  theoretical  investigation  of 
the  recognition  match  in  Soar. 


LcKical  access 

Lexical  access  provides  the  raw  material  for  constructing  the  utterance  model.  More 
specifically,  lexical  access  retrieves  a  set  of  nodes  corresponding  to  all  the  bar-level  zero 
positions  that  the  lexical  item  may  occupy.  These  nodes  are  deposited  into  the  appropriate 
locations  in  the  A/R  set.  Figure  3.8  shows  the  results  of  the  lexical  access  of  square.  Two 
nodes  are  retrieved  corresponding  to  the  adjectival  and  nominal  senses  of  the  word.  The 
nodes  are  indexed  in  the  receivers  .set  under  A'  -head  and  N '  -  head.  In  other  words,  square 
can  serve  as  the  head  of  an  adjective  phra.se  or  a  noun  phrase. 

Lexical  access  is  accomplished  by  associations  in  recognition  memory,  like  any  other 
process  in  Soar.  The  access  of  the  multiple  entries  happens  in  parallel  and  independently  of 
context.  The  parallelism  is  due  strictly  to  the  inherent  parallelism  of  the  recognition  match 
The  context  independence  is  an  assertion  about  the  conditions  of  the  access  associations. 
There  is  a  fair  amount  of  empirical  evidence  that  suggests  lexif'al  acce  >  !■<  independent  ol 
biasing  contexts  (e.g..  (Swinney,  l‘)70)).  liuwever.  (here  are  also  some  tunctional  reasons 
loi  assuming  conlex'.  inde['iendenee,  discussed  in  §3  J. 

NL  Soai  docs  not  [irovidc  a  detailed  mode!  ot  lexical  access  ll  simply  cireumscnbes  the 
required  1  mu.  tioiiality  by  [lositmg  associations  thai  map  words  to  their  entries  as  described 
above.  Ciiven  these  mapping  associations,  there  arc  ai  least  two  distinct  possibilities  lor 
u'aii/ing  lexical  access:  bv  ojicrator  application  ui  by  eiitoding  .rssin  laiiuus  It  ilic  across 
liapjicns  via  optaatur  .ij  pbc.iiion.  then  the  assor  i.itioiis  will  be  eoiklitional  upon  some 
(iiiiliculai  opi'iatoi  i|X’'hajis  a  sperial  lexuai  arecss  ojKa.iIoil.  ll  (he  access  iiajipciis  \aa 
ciiciuliiig,  ihcn  tlic  .issoci.ilions  can  tire  niticpcndcnils  ot  the  r-UMCnl  go.d  context  aiu!  toe  iis 
ol  .lUenlioi)  !  Iieic  is  sonie  le.ison  lo  beliexe  that  lltr  latlei  mav  bi.-  eoiiecl,  die  i.ackiiei  A 
(  i.ii  iell  1  .10  /’.’,)  cxpei  iiU'.  iits  siiv.cesi  that  lexica!  ai .  css  .  ..m  uccm  w  iilioiii  ailculion.  Suit  e 
ecrsvlmg  j)i  Ol  ilk  ii  Oils  are  nol  \  cl  iriipicrncnted  in  .Sc-ai,  N I  Sicii  lies  i  lie  .'ccc;>s  iv>  an  /qici  aioi 
ap[il it sil 'Oil  this  choice  is  uoi  ciiSicai  lo  most  <.'i  t(;e  fiuafii,  lioi's  niadc  ui  die  su!''-,et,jiii'ni 
chapiers.  \\  licit  dicu'  is  aiss  filecl,  it  Wtl|  he  c\|>lK:lh  luitcd 
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He;»d-driiven,  coiistraiat-based  constmetson 

Once  lexical  access  is  accomplished,  the  construction  of  the  utterance  model  proceeds 
by  establishing  structural  relations  between  nodes.  Figure  3.9  shows  the  processing  after 
encountering  Florida  in  John  likes  Florida.  The  relations  are  created  by  link  operators. 
These  link  operators  exist  in  one  of  the  lower  problem  spaces  that  implement  the  u- 
constructors.  As  Figure  3.9  illustrates,  some  of  the  link  operators  are  used  to  create  the 
higher  proiections  of  the  level  zero  nodes  retrieved  from  lexical  access. 

Each  link  may  be  subject  to  a  number  of  syntactic  constraints.  These  constraints  are 
independently  represented  by  separate  operators  in  a  problem  space  that  checks  the  well- 
formedness  of  proposed  links.  For  example.  Figure  3,10  shows  the  constraints  evoked 
for  the  link  operator  that  places  [,v/>  John]  in  specifier  (subject)  position.  The  agreement 
constraint  ensures  that  the  subject  and  verb  agree  in  number.  The  order  constraint  ensures 
that  the  subject  precedes  the  verb.  These  constraints  are  generated  for  any  X-bar  specifier, 
including  determiners  for  noun  phrases.  NL-Soar  cutTcntly  implements  a  number  of  basic- 
constraints,  including  number/person  agreement,  order,  subcalegorization,  and  simple  case 
checks. 

This  parsing  organization  has  two  distinguishing  features: 

•  Parsing  is  a  bottom-up.  head-driven  process  that  begins  with  projecting  phrasal  nodes 
from  their  incoming  lexical  heads  (Pritchett.  1991).  Nodes  are  only  created  when 
there  is  lexical  evidence  for  them.  In  other  words,  there  are  no  explicit  expectation 
data  .structures. 

•  There  are  no  explicit  phra.se  st.mcture  grammar  rules  in  the  system.  In.slcad,  the 
utterance  model  emerges  from  the  interaction  of  the  X-har  structures  projected  from 
lexical  heads,  and  the  independeiuly  repre.scnfed  constraints. 

Nl.  Soar's  parsing  mechanisms  thus  naturally  reflect  t.lie  basic  stnu  ture  of  grammar 
.issuiucd  HI  the  principles  atul  parameters  approach  to  syntax  WliUe  the  eoirslramls  imple 
menied  in  the  present  modei  do  no!  necc:%s.u  liv  map  directlv  onto  the  jrnn'-ipie.s  i>’  pKvIiiies 
!ii  tiH.  .i  closer  and  niorc  tiioruugh  niappuig  should  he  possible  In  anv  er  eut.  tlie  sink- 
iiire  ahe.ub  paeseiit  in  Nl  .Seau  is  lons.istcnt  with  a  iexicalh  generated,  const! amt  basei.1 
approach  to  gisurimar 

IfcskUs  ih.c  simpb'  ii\'  o!  the  mcch.iiiisms  .iiul  the  lound.ilioii  m  hnguisiic  ilieors,  there 
aic  i,’„ood  iunction.i!  icasoiis  loi  adopinig  the  prc.seiit  sclu-iiie.  Holloin  up  pioccssing  is 
well  suited  foi  ti.iruiliiig  1 1 .igmeiil.ii v  in|>ul.  Head  diiven  [nirsiiig  dues  not  eiu  ountei  (lie 
spill  ions  anit'iguiiN  uilkU'iU  in  more  lop  dow  n  .ijiiirviaches  that  miisi  ■■.elecl  .iiiiOiig  N  ariocN 
pliMsc  '.'iiicfu.u-  rulc^  tliai  pudici  iikoniiii;’,  siiuv'iuic  (.Al'iies,  1‘tsoj  i  [k-  implic.ili  >ns  ol 
lisiii:'  iu’.ui  dll.  -I  p, living  uidi  lic.ui  lin.il  l.iiiim.tgeN  kill  I'c  dc.ih  wnh  m  (  lia.piei  S 
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Figure  3. 10:  Checking  constraints  for  proposed  links.  Each  syntactic  link  may  evoke  a  number 
of  independent  constraints, 
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riori-argunicnl  ( i  e  ,  non-posili(mally-niarked  relations)  moditica'.iori  Adjuncts  arc  assumed 
to  be  ('homsky-adjOiricd  (Chonisk  ,,  ld86),  meaning  adjunction  lesulls  m  tlie  creation  of  an 
additional  node.  Figure  3. 1  1  gives  an  exainpieofan  ad)oiiied  structure.  The  relations  ad'/om 
and  iu’iul  are  used  to  (Jistiiiguish  the  structure  from  the  basic  specitiei.  complement, 

.ind  head  stiucturcs  .Adjunction  is  assumed  to  be  uniform  m  that  .uiy  phiasal  level  (zero, 
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Figure  3.12:  Traces  in  syntactic  structure. 

NL-Soar  generates  trace  elements  as  tt  does  any  structural  relation:  with  the  link 
operator.  The  coindexatiori  iS  handled  :n  the  current  system  by  establishing  a  pointer  to  the 
antecedent,  though  it  could  be  handled  by  copying  the  relevant  features  from  the  trace’s 
antecedent  onto  the  newly  established  node. 

The  proposal  of  link  operators  that  create  traces  is  tr  iggered  by  the  presence  of  potential 
antecedents.  There  is  no  need  to  wait  for  the  “surface  position”  of  the  trace,  contrary  to 
(Pickering  Barry,  1991).  In  particular,  these  proposals  test  the  spec-CP  position.  The 
proposal  conditions  for  traces  in  verb  complement  oosition  look  like: 

(<state>  'assrgiiers-spec-CP  <cp> 

'asKigners- comp-V'  <vl>) 

';.;j)ec  <rip> 

head . comp . head . comp . head  < v 1 > ) 


Hr-  altr;buic  luunc  will;  the  dot  liolation  is  simply  a  shorthatid  for  following  along  a  path 
oi  imiltijilc  sliiRiiiral  links. 

I'his  nici.  Iianisni  handles  arbitfaniv  long  distaiiCi.-  dependencies  as  v\  ell 
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lire  I  o  hieak'.  dosci,  she  long  distance  iel:itit>iislu[)  into  a  cliain  ot  stiojl  link*-.  In-tk  'cn 
ifUc!  nu'di.ilc  !; .u  c  ckai leii.'s  .3  mii’. ei s.il  Is icaiitv  const i.uui  ;  sii’oi.icein.  \  !  cusuo  s  duit  c.k  h 
i"ik  111  the  cihun  IS  esi.Uuislicd  o\ei  a  loc.d  ’■trciion  ot  the  iri'e  I  fius.  tlu*  fixed  to,  ai  p.itteni  ', 
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Figure  3.13:  Phrase  slructure  for  a  long  distance  dependency.  The  relationship  between  who  arid 
the  object  of  draft  is  established  by  a  chain  of  local  relations. 


Single  path  and  de.structive  repair 


Nl.  Soar  is  a  single  path  compreherider.  This  is  in J  an  additional  assurnplion,  but  one  that 
f  ollows  iialuraily  f  rom  the  models  assumption  (69)  and  a  basic  <architectU!  al  feafurc  of  Soar 
(the  single  slate  principle).  Fhc  derivation  goes  as  loilows: 

I .  Problem  space  states  are  annotated  models  (itic  models  assumption) 

.9  Models  rcj:iieseiit  one  situation  (;i  l>asic  proptalv  n!  niodcis) 

V  Piofriein  space  states  represent  one  situation  (tioni  1  am!  2) 

I  Soar  maintains  <(  single  state-  per  ai’live  [irotilem  s[iaee.  pr-viou-,  state-,  arc  mu  awn! 
aliie  to  hacki.ip  to,  (tiie  single  stale  piiiieifile  t  Newetl,  I99'()i) 

b  I  heietoK',  N’l ,  Soaa  is  a  siiiL'le  palti  ciimpielieiidc-i  ( tium  '  amt  4  i 
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Actvialiy,  the  argument  is  not  quite  as  tight  as  this.  There  are  two  ways  ol'  si  ipping  in  some 
characteristics  of  a  multi-path  comprehender,  so  that  the  claim  is  weaken  d  somewhat', 
h'irst,  the  identification  of  problem  space  states  as  annotated  models  may  be  given  a  w'eaker 
interpretation  by  allowing  single  states  to  contain  more  than  one  rnodel^,  thereby  repre¬ 
senting  more  than  one  situation.  In  fact,  we  will  .see  below  how  NL-Soar  peirnits  some 
momentary  limited  parallelism  in  violation  of  the  strict  one  pure  model  assumption.  Of 
course,  one  cf  the  basic  findings  of  research  into  the  nature  of  mental  models  is  that  it  is 
very  difficult  to  manipulate  multiple  models  (Johnson-Laird,  1983;  .loimson-Laird,  1988). 

Second,  the  annotations  m  annotated  models  violate  structure  correspondence  by  defi¬ 
nition  and  may  therefore  permit  representations  of  multiple  situations  with  a  single  model. 
For  example,  one  annotation  explored  in  the  work  on  syilogistic  reasoning  (Polk,  1992) 
IS  optional,  which  means  that  the  annotated  object  may  or  may  not  be  in  the  represented 
situation.  However,  the  semantics  of  annotations  are  con.strained  such  that  they  must  be 
interpreted  with  respect  to  a  local  piece  of  the  model.  Arbitrary  scoping  domains  cannot  be 
established,  w'hich  would  begin  to  approach  the  power  of  hr,st  order  logic. 

Even  granting  these  possible  exceptions,  it  seems  clear  that  uniformly  adopting  the 
models  assumption  in  Soar  leans  to  a  comprehension  theory  that  tends  strongly  to  a  single 
path  model.  Interestingly,  the  re,striction  to  a  single  model  per  state  is  equivalent  to 
eliminating  arbitrary  copy  on  demand,  at  least  in  the  ca.se  of  the  uttei  ance  model.  If  multiple 
syntactic  interpretations  are  permitted,  then  at  an  n-vvay  branching  point  (local  ambiguity),  n 
complete  copies  of  the  existing  rnodel(s)  must  be  produced  to  generate  the  new  independent 
models,  (As  noted  earlier,  structure  sharing  violates  .structure  correspondence).  Thus,  the 
single  path  assumption  can  also  be  seen  as  a  way  of  dispensing  with  an  arbitrary  copy 
mechanism. 

The  critical  functional  question  that  the  single  path  a.ssumption  gives  ri.se  to  is;  What 
happens  when  the  incoming  input  is  inconsistent  witlt  the  chrisen  structure? 

Consider  the  ca.se  of  local  lexical  ambiguity  m  (72): 
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arrives,  NF^  and  AP  nodes  are  projected,  and  the  determiner  the  is  attached  in  spec-NP 
position,  forming  the  NI'  the  square].  Then  table  arrives  and  is  projected  to  NP.  Next, 
the  adjective  phrase  [^p  square]  is  adjoined  to  (a/-  table].  Each  syntactic  link  is  vv'ell-formed. 
but  two  mutually  incompatible  bits  of  structure  have  t^een  produced,  since  the  single  token 
square  cannot  simultaneously  be  an  adjective  and  a  noun. 

Such  local  inconsistencies  cannot  be  allowed  to  persist  and  propagate.  They  would 
eventually  lead  to  complete  functional  breakdown  because  the  model  representation  does 
not  systematically  support  multiple  interpretations.  To  repair  the  structure,  NL-Soar  has  an 
additional  operator,  snip,  which  breaks  a  link  previously  established  by  a  link  operator, 

Figure  3.15  shows  how  snip  wor  ks  to  repair  the  structure  in  Figure  3.14.  The  snip  opet  - 
ator  is  immediately  triggered  by  the  presence  of  syntactic  structure  attached  to  competing 
senses  of  the  same  lexical  token.  Preference  is  given  to  the  more  recent  structure,  so  snip 
breaks  the  spec  link  between  the  determiner  and  [^p  square].  Next,  a  link  operator  attaches 
the  determiner  in  .specifier  position  of  [.v/>  table],  forming  jjvp  the  square  table]. 

Snip  is  the  minimal  amount  of  new  mechanism  required  to  effect  repairs.  In  fact,  snip 
does  not  complete  the  repair,  it  just  destroys  a  bit  of  structure  and  then  lets  the  link  operators 
take  over  to  finish  the  JoD.  Snip  is  in  the  class  of  repair  mechanisms  cafied  simple  destructive 
repair  (Lewis,  1992).  It  works  on  the  existing  structure,  without  recourse  to  any  previously 
held  states,  and  the  repair  itself  is  accomplished  by  the  existing  constructors. 

Consider  another  kind  of  momentary  inconsistency  that  does  not  arise  from  lexica! 
ambiguity: 

(73)  John  knows  Shaq  is  tall. 

Figure  3.16  shows  what  happens.  Shaq  is  initially  attached  in  the  complement  position 
of  knows.  Wfien  is  arrives,  it  is  projected  to  an  IP  and  CP  (CPs  are  p.’ojecied  in  the  absence 
of  overt  complementizers,  following  Prii  nett  (1992)).  Next,  a  Imk  is  proposed  to  attach 
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There  arc  good  computational  reasons  for  such  a  tightly  constrained  generation.  A  free 
generation  of  snip  for  every  link  in  the  utterance  model  has  two  undesirable  consequences. 
First,  it  leads  directly  to  a  potentially  large,  undiscriminated  .set  of  operators  in  working 
memory.  As  discus.sed  earlier,  such  .sets  (multi-attributes)  are  a  source  of  exponential  match 
cost  in  the  recognition  memory.  Second,  even  for  moderately-sized  syntactic  structures, 
the  introduction  of  freely  generated  snips  increases  the  .search  space  significantly. 

Thus,  this  constrained  repair  mechanism  avoids  significant  increases  in  both  knowledge 
.search  (recognition  match)  and  problem  .search  (growth  of  problem  .space).  However, 
it  provides  the  functionality  required  to  deal  with  unproblem.atic  ambiguities;  Chapter  6 
explores  the  mechanism  in  detail  on  the  57-sentence  collection  of  unproblematic  ambiguities 
and  garden  path  sentences. 

3.3.4  Chunking  new  u-constructors 

The  processes  described  above  for  constructing  the  utterance  model  are  organized  into  a 
problem  space  hierarchy  that  produces  new  u-constructor  coinprehension  operators.  The 
complete  hierarchy  is  shown  in  Figure  3. !  7.  We  will  step  through  this  Figure,  showing  how 
one  particular  u-constructor  is  constructed  while  parsing  the  sentence  John  knows  Shaq  is 
tall  We  trace  the  processing  after  Shaq  has  been  comprehended,  and  the  lexical  access  for 
is  has  just  completed. 

Learning  a  brand  new  operator  in  ,Soar  requires  learning  two  things  at  minimum:  the 
proposal  of  the  new  operator,  and  its  implementation.  The  impasse  (1)  in  the  top  space 
of  Figure  3.17  indicates  that  no  operators  are  available  to  continue  comprehension.  This 
impasse  provides  the  opportunity  to  learn  the  proposal  associations  for  a  new  operator. 

Soar  responds  to  the  impasse  by  dropping  into  the  Create-operator  space,  which  contains 
the  capability  to  create  new  symbols  (gen,syms)  designating  new  operators.  The  arbitrary 
symbol  u-consti'uctorl r  is  created,  and  an  operator  with  that  name  is  proposed  and 
selected.  Of  course,  another  impasse  (2)  immediately  occurs  since  it  is  not  yet  known  what 
this  operator  will  actually  do.  This  is  an  operator  implementation  impasse,  and  provides 
the  opportunity  to  learn  the  implementation  a.ssociations  for  the  new  o{ierator.  Fhc  first 
impasse  (!)  remains  unresolved,  since  the  operator  has  not  been  proposed  in  the  higher 
space. 

Soar  responds  to  tliis  irujiasse  by  dropping  into  the  IJ  (  onsinict  space,  which  r  ontains 
the  primitive  operators  (link  and  snip)  that  build  the  utterance  model  d'he  state  is  shared 
with  the  C  reate  operator  sjrace.  A  series  of  link  operators  (3)  fne  that  project  the  verb 
is  thioiigli  iP  and  CP  nodes.  As  tiie  structure  is  Iniiii,  new  assoeiatunis  are  anlomatiealiy 
created  thiougli  chunking.  I'liese  associations  iiujilemcnt  the  new  o[)eralor.  Foi  e  aiinde, 
one  of  the  (  luniks  has  the  for  m: 
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Another  impasse  arises  (4)  due  to  the  unavailability  of  additional  link  operators.  This 
provides  the  opportunity  to  learn  proposals  for  new  link  operators.  Soar  then  enters  the 
Generate  space,  which  contains  the  basic  gene.rator  for  links.  The  generate-operator 
operator  matches  potential  assignors  and  receivers  that  are  adjacent;  thus,  the  proposal 
conditions  for  this  operator  embody  X-bar  structure.  In  this  case,  generate-operator  (5) 
proposes  a  complement  link  between  knows  on  the  assigner  set,  and  the  newly  projected 
CP  on  the  receiver  set.  A  generated  operator  is  returned  to  the  U-construct  space  only  if 
it  is  well-formed  according  to  all  the  syntactic  constraints.  Thus,  another  impasse  arises 
(6)  because  it  is  not  immediately  known  whether  the  well-formedness  constraints  for  the 
complement  link  are  satisfied. 

f'inally.  Soar  enters  the  bottom  space  in  the  hierarchy,  the  Constraint  space,  where  the 
independent  syntactic  constraints  are  proposed  and  checked  (7).  In  this  case,  the  proposed 
complement  link  must  satisfy  a  subcategorization  check  (to  make  sure  this  verb  can  lake 
a  CF’)  and  an  order  check  (to  make  sure  the  complement  follows  the  head).  The,se  checks 
pass,  allowing  the  geneiate-operator  to  complete,  which  in  turns  allows  the  link  operator  to 
be  proposed  in  the  U-construct  space  (8).  This  causes  impasse  (4)  to  be  resolved,  building 
a  chunk  which  specifically  proposes  CP  complement  links.  The  conditions  of  the  chunk 
integrate  the  independent  con.straints  checked  in  the  Con.strairit  space: 

IF  a  node  X  is  in  the  assigriers  set  indexed  by  comp- V' 
and  X  can  take  a  CP  complement 
and  node  Y  is  in  the  receivers  set  indexed  by  comp-V' 
and  Y  is  a  maximal  proiection  of  category  C 
and  Y  follows  X  and  is  adjacen!  to  X 
THEN  propose  the  link  operator  for  acoinplemem  link  between  X  and  Y 

fhe  complement  link  is  performed,  prc'ducing  more  implementation  chunks  fot  the  new 
operator  u-con.structo.rl7.  When  the  link  is  complete,  a  sniji  opeialor  is  iinmediaiely 
[uoposed  (9),  triggered  by  tlie  two  siruelure.s  eompeiing  for  the  same  position  as  deseoln'd 
eai  her  The  snip  breaks  theeoniplemerit  link  iretween  know  s  aiul  Sh.a(/.  proiiueing  adduional 
impleinentalioii  chunks  lor  u-constructoi  17.  The  resuhing  slate  s.itisfies  the  Londitioiis 
ioi  the  link  operator  that  esiahliNhes  NPs  in  siibicet  (spec  IPi  jHrsition,  fins  link  eonijiletes 
the  miplenieulatiiui  of  u  -  consti  uc.tu  t  1 7. 

I  lie  sucec>sliil  eompleluui  of  the  ii  eunstiiatoi  apiplu  ation  inggeis  the  reiKfii  trpt  raior 
o[iciaioi  (  lOi  in  the  (  reate  operaloi  sjraee  K’etnni  opciatoi  sniipfiv  jHoposcs  u- c  (me  t,  rue  tor 
III  llic  lop  spate  (Mr  I  lie  resulling  etmiik  h.is  the  loiiii: 

U  du  ie  IS  .1  node  \  in  llie  leccn'er,  set  iiulcxed  bv  head  I’ 
and  a  uiulc  V  on  tlie  .issipnci'-  sei  uidexcd  in'  (omf'  V 
,ihd  y  .('.sii.’iis  a  I  oiiipiemciit  loleloaiiNI'/ 
iild  'i  (  ,111  f.lke  .1  <  i’  coiiiplcnu'lil 

.iiid  -/  .Hhl  X  artec  111  iiiiiiil'er  .ind  \  tollons  .nid  is  .idpn  cut  to  V  .ind  / 

Ii  il  \  pi  opt  ISC  n  ■■  .  aec  I- 1  11 .  1  to- 1" 
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The  conditions  of  the  proposal  ctuinks  integrate  over  all  the  various  constraints  required 
by  the  links  and  snips  that  ultimately  comprised  the  implementation  of  the  u-constructor. 
The  chunk  is  not  specific  to  particular  words,  since  any  lexically  specific  information 
required  for  building  the  utterance  model  i;.  retrieved  by  lexical  access  prior  to  the  im¬ 
passes.  After  chunking,  the  behavior  looks  like  Figure  3.2.  The  proposal  chunk  for 
U“constructorl7  fires,  the  operator  is  selected,  and  the  implementation  chunks  directly 
apply  the  operator  without  impa,sse. 


General  features  of  u-constructors 

U-consti-uctorl7  illustrate.s  a  number  of  important  general  features  about  u-constriictors 
and  their  creation  in  NL-Soar: 

«  A  single  u-constructor  may  be.  composed  of  many  primitive  structure-building  op¬ 
erations.  The  u-constructor  collapses  into  one  operator  application  the  .sequential 
problem  solving  that  involves  generating  and  testing  potential  links  against  indepen¬ 
dent  constraints. 

•  There  is  no  fixed  vocabulary  or  number  of  u-conslnictor  types.  Any  sc<rt  of  operator 
may  be  produced,  ranging  from  fairly  simple  constructions  to  relatively  complex 
and  specific  constructions,  as  u-construc.torl7  demonstrates.  U-constructorl7 
may  be  interpreted  as  the  “project  incoming  verb  to  complement  phra.sc,  attach  as 
sentential  complement  and  reanaly.se  the  previous  NP  complcrnent  as  the  subject  of 
the  sentential  comidemeut”  operator.  Tfie  set  of  available  u  constructors  defines  die 
rccognitionally  available  syntactic  knowledge  in  die  system. 

•  rtie  Cvinterit  of  the  operator  is  not  specilied  by  explicit  data  structures  winch  .ou'sl  fic 
iniecpreletl  to  \ield  tlie  behavior,  .All  that  apfusu''  m  die  (ofi  sp.tce  is  die  geusvmed 
iianie  of  die  opeiator.  Wiial  dial  sviiiIhiI  rel-'ss  to  is  iield  in  the  iiiqilementaiion 
asso(.'ialions  in  .rei.  ognilion  memorv.  i  he  u  i  oiistrik  toi  s  inav  coi  resiaand  io  sfiecria' 
svntactic  conslrnclions.  bu:  duw  caniiol  be  viewed  ,is  expheit  [ihiase  stiueluie  rides. 

•  file  iei.i.ui  piovx'ss  IS  seamlessly  uitegralrd  as  a  nan,  ol  leeogiiiiioiia!  eoiiijircheiisHMi 
at  tiiC  top  le\ei  i.n  :  ori,et  rse,  t(.o  l7  i.s  .in  ()pe.taioi  dial  e'.oks  s  .i  i.'.pid  i iSiiiab.  sis,  liiil 
il  IS  no  dilteien!  in  knui  ilutu  any  oiher  n  i  oie  iineu -i  ih.u  j.M  iptdaie'  Un.'  lori  [Hidilein 
sp.iee. 


3,4  sHupJiosii  smodpS 

I  lie  si!  (Mi  ion  mode  i  ■  s  ■  es  .e-  bodi  a  seni.iiil  n  .nui  I'e  U-|c!  li  i:d  1 1;  j  a  e-a'  iii  al  it  ni  <  :  . '  i  i  hi  I !  m. 
seilioi  v.e,amsidt‘i  !)i  '  sii  u>  lui  e  .nui  ta  nin'iiial  n  ai  t  if  ilit'  Miii.iinai  HiOile!.  usii.'an'  .  le  in, 
‘  ilk'  av  I  sta  nan:  It  i  t'pi  estaii.il  n  ai  the  pi  liecssta. .  >i  i  ehaaail  lest  g  m  mn  .  n  t-  tiesi  i  ;i  a  a  I  in  f  '  a 
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3.4.1  What  the  situation  model  represents 

Many  semantic  theories  in  artificial  intelligence  and  linguistics  make  some  commitment  to 
a  distinguished  level  of  primitives  into  which  all  meaning  is  decomposed.  For  example, 
the  Katz  &  Fodcr  (1963)  theory  decompo.ses  meaning  into  primitive  semantic  markers,  and 
conceptual  dependency  (Schank  &  Riesbeck,  1981)  analyses  events  into  a  fairly  small  set 
of  abstract  schemas. 

An  alternative  approach,  and  the  one  taken  in  NL-Soar,  is  to  assume  that  the  ontology  is 
at  least  as  rich  as  natural  language  itself.  This  leads  naturally  to  a  lexically-based  semantics. 
The  kind  of  representation  that  emerges  from  such  an  ontology  is  in  a  sense  superficial;  it 
closely  reflects  the  particular  lexical  content  of  the  surface  form^. 

The  50,000+  synsets  of  WordNet  (Miller,  1990)  provide  one  possible  basis  for  a 
lexically-centered  semantic  ontology.  Each  synset  corresponds  to  a  set  of  synonymous 
word  senses,  and  therefore  provides  a  unique  category  label.  Although  NL-Soar  does  not 
systematically  exploit  WordNet  (or  any  other  ontological  resource),  WordNet  exemplities 
the  kind  ontology  assumed  in  N1  .-Soar'*’, 

We  further  assume  as  part  of  the  ontology  a  decomposition  of  situations  into  objects, 
relations,  and  properties.  The  semantic  domains  for  each  of  lire  tiiree  basic  elements  arc 
unrestricted — anything  drawn  from  the  rich  ontology  is  poicsible, 

3.4.2  How  the  situation  model  represents 

The  siluation  model  is  an  annotated  model,  in  keeping  with  the  basic  imxlcls  assumption 
in  Nl,  Soar  (69),  Tlius,  the  situation  inoclci  consists  of  oh|eets,  propertu's,  and  relations 
that  map  directly  onto  tin.  objects,  properties,  and  rela'ions  in  the  leprc.sented  siiiuitioti. 
Figure  .ViS  shows  iiow  the  model  is  realized  in  attribute- value  structures  m  working, 
memory  fins  example  illustiates  a  few  basic  points  about  Nl  So.u  s  situation  models: 

•  I'liere  is  no  obligatory  pomil.o  e  dc'eonipi'Mlion,  l  or  example,  ilic  profie;  t\  ol  liemg 
a  ..  winch  could  In-  liccoiiiposeil  mto  sometiimg  like  iinttuir  t  icd  is 

lel.imed  as  a  lust  cLiss  ontt'logiv  al  ■.  .ilei'oix  ,  Ifiis  is  ima  a  rcllci  tion  ot  the  lexic.ilix 

based  smu.uilKs  diseiissed  aho\e.  I  lieie  is  no  limit  to  hovx  speedie  llie  semantic 

latieis  niav  tie  lliex  w  ill  he  ,is  specilic  as  the  nlle'anee  icijinres 

»  Allluuigli  ilieie  is  Mu  (ihligaunv  ik lornposii mii.  nnae  geiieial  [noiHuties  max  he  ex 
phculx  ie|iiesenled  O'  g  ,  the /;u,'//inrc  propii  lies  ol  ami/',. -//I  i  he  lexnallx 

hased  uiUulu;",  iloes  m>i  prei/Inde  deeonipo'.nig  the  specihe  v  .ileeoiies  iniu  imne 
eenei.il  ti'atuu's  it  pi ui.'i.'ssmt'  ikmiaiui'.  it 

1  iu’ir  i  •  'O  'll  10  i '  'x  '  I  M  ‘1 1  'I'  16  .tl  J  .IM-,  ( I  «|  '-.upr'l  Ik  Kil  ’•(‘in.iniK  1  v*j  d  x--.-:*Il[.-.!  K  •  1 L-  i  .N  H  I K  ’  11  .V  I  i  ,1  k  x' I  '  /  ' '  /  !  i  'v|m[ 

‘■1  ,ll  t  I 

'  i  Sh'  uio'pl  1'  -  fl  I  '!  1  Ir  \  K  .ll!  .  I  '.L-s'.l  r  'ill '  »  li  U’-.  |K  >(  I  uli'  mllf  llu'  !  I K  >  '!  j  ’>  *1  . !  (  t '  '  cl!  MU  I  K  c  .  ‘ 1  K  '  s 

•A  I  !  It  !!',  !  k-  \  h  ,it  !  i'.lk.Ml  Ir  -11  K  ||  t  v  .tlllj’lf  !!Mn\  i  *f  lIlC  UK  .if"  '1  .U  t  v  i.i'.sL's  .l-.MlIiUhl  { '  '  v  1 11 } ' !  I  ■.  6  !  io  'vM 

j  ’1  i  1 1  ')  I  '  '!  ..-U  l.ll  1 1  !  ,  V  I  .  iK  iilf  o  .ii  r  IlH'.M  II  •’  lit  \\  .  H  1 1  Net  M  5  u.  O  flu'  -.f  •.  l.t\'-..--N  ,!,1  »•  Did  MC.  -  ,il  1  i  k  i  ,  1*  U  I  i  k\ 
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(obj2  “property  pi  p2  “relation  rl  r2) 

(pi  “name  isa  “value  see2-event) 

(p2  “name  time  “value  past) 

(rl  “name  agent  “to  cbjl) 

(r2  “name  object  “to  obj3) 

(obj 1  “property  p3  p4)  (obj3  “property  p5  p6) 

(p3  “name  isa  “value  bachelorl  (p5  “name  isa  “value  bsll3) 

(p4  “name  animate  “value  t)  (p6  “name  animate  “value  nil) 


Figure  3.18:  a  possible  situation  mcxiel  for  The  bachelor  saw  the  ball.  see2-ovent  corresponds 
to  one  of  the  meanings  of  see  as  a  verb;  ball 3  corresponds  to  one  of  the  meanings  of  hall  as  a 
noun,  and  so  on. 

•  The  situation  model  as  depicted  in  Figure  3.18  doe.s  not  encode  the  iderctities  of  the 
objects.  It  Just  encodes  semantic  descriptions  which  may  refer  to  existing  entities. 
§3.5  discus.scs  the  process  of  referent  re.solution. 

■Adopting  a  mental  model-based  representation  raises  many  difricult  reprcsentioiial 
issues,  such  as  how  to  handle  quantification,  di.sjunction,  abstract  concepts,  propositional 
attitudes,  and  so  forth.  Although  these  are  difficult  problems,  they  are  not  insurmountable, 
Johnsoii-I.aird  ( 1983)  sketches  solutions  to  some  of  these  problems.  Indeed,  the  fact  that 
tiHHicls  arc  not  well  suited  to  handling  arbitrary  disjunetion  and  cuianfilicatioii  is  a  source 
of  explanatory  pow'tr  tor  the  theory  in  accounting  lor  pen'ormance  on  verbal  reasoning 
tasks.  Nevertheless,  as  with  any  theory  of  semarilic  repre.sentatu)n,  mueh  work  remains  to 
be  done  At  this  stage  in  its  development,  NL-Soar  simply  .uiopis  (he  models  framework 
w  ilhoiit  ads  anemg  it  i  urttiei  as  .i  representational  theory. 


3.4.d  ('(»iistrsictin>»  thf  .sitiiatittn  model  and  chunking  s-eonsfriictors 

I  Ir-  Mtii.ilion  modi’i  is  asscmhleil  in  ihe  ,S- evinsli net  space,  the  eountcip.ut  ot  I  I  lonsiiiic! 
ior  !.i  eoiisd iic'.o! s  fheie  ,iie  (tnee  o|ieiaIois  m  .S-(.onsIiiK  t,  eoi icsponding  in  [fic  ihice 
Iseae  enlides  in  inocl'is  i  reatc  ref, ■rent  esl.ibl i -.lies  ,i  ness  ol>|ecI,  .,'<,1  / /'.>< '/r  v/\  .tiigiiieiUs 
.III  existing  nl'iei'!  .i  ['ii'peilx,  .nul  u./d  ithition  establishes  ,i  icl.iiuni  ixciwccn  two 

oiijcels  !  (u'si.'  itpeialois  m.ip  ilueelK  timn  liu-  uttciaiK'c  nuulcl  .iiui  lc\u.'.il  (e.ituies  ii'  ihr 
Miii.ilinii  hiiiilc!  1  lu'  siiu.iliiiii  model  ciicodc.^  (he  leleicncc  nuicg'ciuienl  nuMiiins.'  ol  ilu- 
ulici  ,i!ii.  c,  11  SCI  \  t’s  .IS  inpiil  lo  I  he  icleuii.e  u'solulion  opci  ,iloi  s 

I  igiiic  '  I  o  (i.u  c-.  llic  ooii,s;i  iU'lioii  ol  [\ii;  o|  (be  smi.iiiv  n  niodid  \',  Inlc  i  Dinpicluihling 
I  ht'  .‘l/,;s;n  ■/.•/,,;/(•./  ilw  Kuuk'..  V\i'  hecin  |iisl  .ittci  d<  h’auj  li.o  hc.'ii  I'loR'cicd  iv'  .i  \  i’ 
•  ind  II'.  .iiid  i'.,/  i/’n'  .Mavu  1  h.is  in'cn  .iK.ii  tus!  ni  sul'|e>.;  posiiioii  I  usi.  .i  chmU'  HUciciii 
:  i[';-|  ,IIO-l  i'sl.tl'hshcs  .1  ictcicill  ioi  PiC  \'|'  nsUilIi  'riM  Ihs  oil!,-  du'  ,'inc.  !  ii-pu'  .Cilinl;’ 
till'  lOi'iil  vies,  1  ii  H'd  i>\  dll'  M'liK'ik'r  (’'■-.lie  ictriciii  .dso  ;'cn,' I  ,iu''-  ,1  u'li':nin'  ul.ilion 
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between  part  of  the  utterance  model  (in  this  case,  the  VP)  and  the  new  object  in  the 
situation  model.  Next,  add -property,  triggered  by  the  lexical  content  of  the  VP.  augment.s 
the  object  with  the  property  defeat4  (labeling  some  kind  of  defeat  event).  .Finally,  add- 
relation  establishes  the  agent  relation  between  the  referent  of  the  subject  of  the  .sentence 
|,v7>  the  Magic]  and  the  referent  of  the  VP.  This  completes  the  situation  model  for  the 
fragment  [//>  [  ve  the  Magic]  [i7>  defeated]]. 

New  s-constructors  emerge  in  a  manner  similar  to  u-constructor  creation.  The  problem 
space  hierarchy  for  building  s-con.structors  is  shown  in  Figure  3.20.  The  hierarchy  parallels 
the  structure  for  building  u-constructers  (with  one  important  difference  which  we  will 
discuss  in  §3.6).  Like  u  constructors,  s-constructors  are  assigned  gensym  constant  names. 
Chunking  over  the  deliberate  construction  of  the  situation  model  produces  proposal  and 
implementation  associations  for  new  s-constructors.  For  example,  the  problem  solving  in 
Figure  3  19  produces  a  proposal  a.ssociation  fora  new'  s-constructor  (s-constructor23): 


IF  there  is  a  VP  headed  by  defeat 

and  its  IP  projection  has  an  NF  in  specilier  position 
and  the  NP  has  a  referent 
7'HZ:.N  propo.se  s-const ructor23 


The  example  in  f  igure  3  Id  illustrates  |ust  the  simplest  kind  r)f  situation  model  con¬ 
struction  In  general,  building  the  situation  model  may  require  drawing  on  task-specitic 
kno'.vletlge  sources  and  engaging  in  consideiable  problem  solving  Although  such  pro 
cesses  are  not  an  implemented  p.ut  ol  NL  So;u,  the  basic  architecture  supports  them,  hhe 
additional  knowledge  sources  may  be  added  in  the  .S  coiisiruct  s[)ace.  or  in  spaces  below 
.S  consiriict  VVh.Mevei  lorni  the  knowledge  t  ikes  in  the  lower  spaces,  the  arcliileciure  (par 
ticularl\  chunking)  ensures  ihai  the  svstem  coiitinuall v  m.ikes  the  sh  It  trom  delibei sitiou 
to  rec(.>g,nition.  so  th.ii  o\er  (ime  e\eii  some  l.iiriv  i.isk  specihe  micrpiet.ition  knowledge 
nia\  t>e  bnnighi  lo  be;u  .is  ii.iii  ot  iecogiiitioii.il  compieticrision  fins  process  is  c.iileil  the 
ui.skifh  aiirii  ot  1,  ngu.igc  (I  ctimaii,  .Neuell.  Polk  A  I  ewis,  |o','’iti) 

W  hat  semanfie  features  sfionk!  It  \ieai  access  leti  i«  \  t  ? 

(  liiiitk  I  '  I  I  cle.il  I \  I  .uses  .m  mipoi  l.iiil  issii',  liH  seiiumtic  mter  ['rcl.il ion  in  Nl  .So.u  h,il 
sel  ot  seiiMi  i!  K  i  C-i!  ii  ICS  shoM  Id  be’  ii’li  ie  \ i!  b\  lex  ic.i!  ,u  a  css’  I  ’i  o|)its.il  .osok  i.ii  ion  i  i  i  .u 
lu.il  l\  (es!  ,  I  Ol  die  sped  Ik  le  x  ic.ii  iieiii  i  llie  iimntlc-.'tcd  root  i  !  hiis,  the  I'leseiii  oiiMin/.ii  lot' 

! epi  eseiils  the  e xlienie  h\  po diesis  di.ii  texu  al  .acess  pei  se  inildlK  i el i  ie  \  es  sein.uifu 
te.iimi-s  In  eSIed  the  srin.mKi  .leeess  ol  itu-  lexicon  n  .u  eoniplisiu  ii  tiisi  \  i.i  die  .aid 
pi  o  I  S' It  N  .ir  id  .hid  I  e  hit  n  '11  op,-i  .iloi  s,  .md  at  lei  eliiiiik  iiig  \  ui  the  •-  k  onsri  ii  U  >(  -.  \  it  tioiig  h 

dlls  i  \  1 1  ei  !iL'  oi  e.ni ;  .'  .li  loii  is  pi  o| Sidl  \  w  i  >  ii  le  tiu'  .iltei  u.iii  \  e  m  g.m  i  <M!  ion  d  i.u  m  !  i  i(;i  s  se  h'l  I  s 
tioiii  ,i  pi  i  .  s 'll !  pi  1  led  si'l  ,il  sici.tiilu  senses  Is  ,iIm.'  piob.ibix  wion;.'  u'  :  il’iislepo  .k\  A 
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Figure  3.20:  The  problem  spaces  for  building  s -constructors.  Create-operatot  generates  new 
operator  names,  and  S-construct  contains  the  primitive  situation  model  construction  operators. 

Kepairing  the  situation  model 

Situation  model  repair  follows  on  the  heels  of  utterance  mode.l  repair.  Unlike  the  U-constrircl 
space,  however,  there  is  no  additional  dest'uclivc  operator  in  S-cov>struct,  As  the  utterance 
inudcl  evolves,  it  niggers  new  s  conslructo".':  that  keep  (lie  situation  model  current.  When 
the  new  structure  conflicts  with  part  of  the  existing  structure,  the  older  structu.re  is  simply 
drsliuycd  as  part  of  the  process  of  establishing  the  new  piece  of  the  rHidel. 

This  minimalist  approach  to  repairing  (he  situation  model  is  iwt  guaranteed  to  remove 
all  extraneous  or  oul-o* date  striiciure  Ironi  the  situation  model.  For  extiniplc,  m  the  repair 
ol  The  .srpuif  f  rahit'  (Figure  3  1 3 ),  the  evolution  ol  the  uuerance  model  from  i ,v/.  ihe  .vew<//e j 
to  I  ve  the  j  .vr/nmej  tahle]  will  give  rise  to  two  situation  model  ofijccts,  namely,  a  .sc|uarc 
and  a  taiile  The  spiiaie  will  not  lie  deaioved  upon  creating  die  square  table,  .suice  at  the 
lew!  of  the  ^miatiou  model  there  is  nothing  ahom  (lit  table  that  is  uK<.tnip.ats!ilc  with  the 
existence  ol  (tie  Mjiian’  llov  cici,  tlic  square  will  icrnaiii  discofmev  lcd  ffruu  the  e ‘.teuilcd 
situaiioii  model,  since  the  icmaiudcr  ot  die  uilci  aru'e 's  about  the  tabic  lints,  it  ilio  sentence 
eontuiiicd  ihr  t'  ciihh'  i.\  Itw  /my-i'  hfF  inv  Ininp  roi^ni.  N'l.  Soar  would  not  hs'  ea.inf nsfij 

into  thinking  that  !hf  is  too  huge 
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Although  this  repair  scheme  looks  promising,  it  i.s  still  nascent.  Further  experience  with 
extended  texts  will  be  required  to  establish  its  viability. 


General  features  of  S'Con.structors 

The  examples  of  situation  model  construction  described  above  illustrate  several  general 
features  of  s-constructors  in  NL-Soar,  many  shared  veith  u-conslructors: 


v*  A  single  s-constructor  may  be  composed  of  many  primitive  structure  building  oper¬ 
ations  (e.g.,  add-property). 

•  There  is  no  fixed  vocabulary  of  s-constructor  types.  The  variety  of  s-constructors  is  a 
function  of  the  indefinite  variety  of  structures  potentially  produced  by  the  S-construct 
space , 

®  The  content  of  the  operator  is  not  specified  by  explicit  data  structures  which  must  be 
interpreted,  fn  particular,  there  is  no  intermediate  mapping  language  between  the  ut¬ 
terance  model  and  situation  model.  Semantic  interpretation  knowledge  is  esse  itially 
held  in  the  proposals  for  situation  model  constructors  (both  the  s-constructors  in  the 
top  space  and  the  primitive  operators  in  the  S-construct  space). 

•  The  repair  of  the  situation  model  is  directly  triggered  by  the  results  of  the  repair 
process  in  the  utterance  model.  The  situation  repair  is  an  integrated  part  of  the 
s-constructor 

•  The  construction  of  the  situation  mode!  is  an  incremefital  process  that  works  t)n  pari  ial 
syntactic  structure.  S-ccnstructors  do  noi  w.ut  fot  the  completion  of  any  particida; 
synlactic  node  before  they  begin  interpretation 


3.5  Reference  resolution 

Rclercnce  lesoiulion,  like  scm.tntic  interprelaiton,  can  ailuOary  amount  ol  inicr- 
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isa  table 
.size  big 
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isa  table 
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Figure  3.21;  Res^olve  operators  recognize  aspects  of  the  situation  model  as  semantic  descriptions 
of  known  objects.  The  recognition  produces  a  constant  symbol  that  is  the  identifier  of  the  object  in 
long-term  miemory. 

3.5.1  Reference  resoludon  as  recognition 

NL-Soar  takes  reference  re.solution  to  fundamentally  be  ap.  ocess  of  recognition.  Reference 
resolution  recognizes  pieces  of  the  situation  model  as  pa.rfi.ii  de.scriptions  of  known  objects. 
The  process  augments  the  situation  model  with  constant  identifiers  which  are  symbols 
denoting  the  object  in  long  term  memory  (that  is,  know'edge  about  the  object  is  held  in 
associations  in  long  term  memory  that  have  the  symbol  as  part  of  their  conditions  or  actions). 

Thus,  there  is  no  representational  level  between  the  syntax  and  the  referenti.u!  represen¬ 
tation:  the  situation  model  is  ihe  referential  representation,  or  mc-re  accurately,  the  resolved 
situation  model  is  the  referential  representation.  This  organization  contrasts  with  muiti- 
ievei  models  such  as  READER  (Thibadeau  et  al.,  1982),  which  posit  independent  levels  of 
semantic  and  referen/ial  encoding.  In  effect,  the  situation  model  starts  as  a  semantic  (sense) 
encoding  and  evolves  into  a  referential  encoding. 

F'igure  3.21  shows  a  simple  example,  resolving  die  noun  phrase  the  big  table.  Suppose 
there  are  two  tables  in  the  discourse,  only  one  of  which  is  big  The  big  table  has  the  LTM 
identifier  t42,  and  the  other  table  is  t37.  First,  a  resolve  operator  is  applied  to  the  new 
.situation  model  object,  instantiated  for  the  properly  isa  vable.  This  operator  t' aggers  two 
associations  in  LTM  that  retrieve  t37  and  ■':42  as  identifiers  of  objects  that  satisfy  this  part 
of  the  description.  The  situation  model  object  is  augrn£,rited  with  both  identifiers. 

Next,  a  resolve  ap  rator  is  applied  to  the  object  with  respect  to  Ihe  property  size  big. 
This  operator  triggers  a.i  association  that  recognizes  s;ize  b;ig  as  a  pronerty  of  t42.  Since 
i.37  i.s  not  so  recognized,  it  is  removed  as  an  augmentation  of  tde  object.  The  result  is 
the  uciitiue  identification  of  the  .square  tahL  as  t42..  Had  this  part  of  the  description  been 
insufficient  to  uniquely  identify  the  refereni,  both  identifiers  would  have  persisted,  awaiting 
for  incoming  rnatcri.il  to  pi.re  down  flic  .set  te.g,  a  po.st  ’!omiiial  f'F'  like  in  the  bedroom). 
Tims  kind  of  increnicntai  aj.iproach  is  t  iinihir  to  the  one  u.sed  in  .SHRDLl;  (Winograd,  1972) 
(:a'c  also  (.AJtniann  &  S>eedniari,  1988)). 


3.5.2  BoOdiiip,  F«ct'igtijtson  chrinks 


Vdlicic  do  iiiese  jelenar  e  n::solm;on  associations  come  troin  '  Figure  3  22  iliusdates  what 
liafi'iCiis  wlxsi  ih  -  r''-s(,.|ve  opt  ra’or  ?s  applied  to  the  is.i  iablo  propeiij,  .,nd  associations 
isii  !on,Hiii”ve  any  tdaccr  idenufiers.  A:i  inipicmentation  impasse  aoscs  and  121.  Soa;  lakes 
this  as  a  sinnal  dial  tlie  |srese)‘!l  diVi-iiphoM  mtioduccs  <i  new  iibjem  mto  the  ;hseoui'S!,s  ini' 
Miirsisse  pftr, ades  ‘iie  oppoi1i;nu\-  to  iearn  new  recogntlion  chunks  lor  ihc  description.  So;n‘ 


umikiiii  ii 
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Figure  3.22:  Learning  new  reference  re, solution  recognition  chunks.  When  a  new  piece  of  the 
situation  model  is  not  recognized,  an  impasse  arises  on  a  resolve  operator.  In  the  Assimilate  space, 
the  system  learns  to  recognize  the  novel  structure  by  associating  its  components  (properties  and/or 
relations)  with  a  newly  generated  constant  identifier. 

drops  :into  the  Assimilate  space  and  creates  a  new  constant  identifier  to  associate  w'ith  the 
new'  object.  Next,  an  assimilate  operator  is  applied  to  the  attribute  and  value  pair  associated 
with  this  property.  Based  or.  the  recognition  of  the  property,  the  new  identifier  is  returned 
to  the  higher  space.  The  result  is  a  chunk  of  the  form: 

IF  applying  the  resolve  operator  to  a  situation  object 
(7.5)  with  respect  to  the  property  isa  table 

THEN  the  identifier  of  the  object  is  t37 

Similar  processing  produces  the  chunks  that  associate  properties  w'ith  established  ob¬ 
jects.  For  example,  associaung  big  with  the  object  t,37  produces  the  chunk 

IF  ijp{)iying  the  resolve  operator  to  a  siluation  object 

witii  respect  to  the  prooerty  s  ize  big 
( 76)  .  - 

and  the  object  has  identifier  t37 

FHHN  t37  is  recognized  as  having  this  property 

■Such  chunks  can  be  used  to  narrow  down  (he  referent  set  as  described  above. 

rills  Lise  of  c  hunkH,g  provides  a  clear  example  of  cliunkiiig  a.s  knowledge  level  aequi 
sifion:  specduj)  learning  is  not  a;  issue  licre.  The  system  is  not  .simidy  learning  lo  rei.ogni/e 
somcfhint!  faster,  'I'he  recognition  chunk’s  existence  encodes  the  tact  that  the  system  has 
seer,  the  obiecl  lictore  i Roseniiioom,  .Newell  ei.  i.,aird.  i‘)ril  i. 

Recogiiiiti/n  thunk  17,5)  ts  overgeiicra!.  M  cliunks  like  this  wcu,  aiw.iys  produced, 
ideritilrei .lor  all  lire  ever  eneourilered  b,'  tlie  system  would  be  retrieved  wlicn 
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resolving  table.  Additional  contextual  discrimination  is  required  to  restrict  the  pool  of 
potential  referents  to  a  manageable  set.  The  discourse  segment  stack  of  Grosz  &  Sidner 
( 1986)  is  one  kind  of  mechanism  that  uses  the  .structure  of  the  discourse  itself  to  provide  the 
necessary  discrimination.  In  NL-Soar,  this  is  partially  realized  by  having  the  recognition 
process  associate  the  new  identifier  not  only  with  a  particular  property  or  relation,  but  also 
with  some  representation  of  the  current  discour.se  segment.  If  the  discourse  segment  is 
represented  by  a  constant  identifier  on  the  state,  then  chunk  (7.6)  would  look  something 
like: 


IF  applying  the  resolve  operator  to  a  situation  object 
with  respect  to  the  property  isa  table 
and  the  discourse  segment  is  s98 
THEN  the  identifier  of  the  object  is  t37 

This  is  just  barely  a  beginning  of  a  theory  of  discourse  structure  in  NL-Soar,  and  it  raises 
additional  questions,  such  as:  How  are  the  discourse  segment  symbols  retrieved?  How  are 
transitions  made  from  segment  to  segment?  While  a  theory  such  as  Grosz  and  Sidner’s 
provides  some  help,  the  implementation  within  the  constraints  of  Soar  is  nontrivial.  For 
example,  one  cannot  simply  directly  incorporate  the  stack  structure.  Soar’s  recognition 
memory  is  not  a  stack. 

3.5.3  Long  term  memory  of  content:  reconstructive 

The  process  of  reference  resolution  in  NL-Soar  has  an  important  side  effect:  the  recognition 
chunks  constitute  a  long  term  memory  cf  the  content  of  the  discour.se.  The  memory  is 
recognition-based,  not  recall-based.  NL-Soar  does  not  perfectly  memorize  the  content  of 
each  discour.se.  It  cannot  recall  arbitrary  parts  of  the  discourse  ba.sed  on  arbitrary  cues. 
Rather,  memory  retrieval  is  necessarily  a  reconstructive  process  in  which  other  knowledge 
sources  must  be  used  to  cornpo.se  structures  that  trigger  the  recognition  chunks. 

A  simple  example  will  illustrate.  Suppose  Soar  has  comprehended  the  utterance  The 
iiop  is  a  Spaniel.  Reierencc  re.solution  will  build  two  chunks  of  she  form 

(78)  (a)  rsa  dog,  >  o2.3 

(b)  o2.3,  breed  spanrej  >  recognized 

Su|ipose  at  a  later  time  Soar  must  answer  the  question  What  breed  is  the  tioi;''  Assuming 
that  retcreiit  resc'lution  correctly  identifies  tlie  the  day  as  .Soar  must  retrieve  the  breed 
associated  with  o23. 

The  problem  is  iliai  chunk  ( /.Sb)  is  not  a  recall  chunk  In  other  words,  it  does  not  map 
lo  le  1.  Soar  must  (,‘ngage  in  a  geneialeand  test  m  an  allem|U  to  ingnei  the  cliunk. 
f  igure  v,.’.)  stiows  c>[ie  tn)ssit)|e  reah/alion  of  the  process  .An  impasse  ariscss  (  I  )  when  the 
ausutu  to  Ihe  (|SK‘slinn  is  not  inifiKsli.alely  retrieved  .Soai  enteis  a  s(i;ice  Ic.  leconslruc!  the 
.uisvvei,  .1.1  .1  '.i.'Mcs  ol  iesol\e  oficralois  that  generate  plansitvle  candidates  loi  (he  dog’s 
breed  !  b.iscd  on  g;eiiei;d  knowledge  a.bout  clogs),  f.ach  icst.ibe  opcualoi  is  ol  die  same  lorm 
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Figure  3.23;  Recall  by  reconstruction.  The  system  is  attempting  to  ren-ietnber  what  breed  a 
particular  dog  is.  The  method  is  generate-and-test.  Plausible  candidates  are  generated  via  general 
knowledge  sources,  and  the  correct  answer  triggers  the  recognition  chunk. 

as  the  resolve  operators  used  during  comprehension:  each  associates  a  particular  property 
(in  this  case,  a  dog  breed)  with  a  particular  object  (in  this  case,  a  situation  model  object 
identified  as  o23).  When  the  correct  choice  (spaniel)  is  generated,  chunk  (78b)  fires  (2)  and 
the  goal  is  achieved.  Spaniel  is  returned  as  a  result  of  the  problem  solving  (3),  building  a 
chunk  of  the  form 

(79)  o23,  breed?  spaniel 

As  a  result  of  the  reconstructive  problem  solving.  Soar  now  has  the  knowledge  available  in 
a  new  form;  a  recall  chunk. 

This  generate-and-test  behavior  is  pari  of  the  solution  lo  the  data  ch.unking  problem 
in  Soar.,  which  is  essentially  the  problem  of  using  chunking  to  store  away  declarative 
knowledge  in  a  usable  form  in  the  recognition,  memory  (Newell,  1990;  Ro.senbloom  et  ah, 
1991).  The  succe.ss  of  any  recoivstructive  process  depends  on  finding  good  generators  to 
limit  the  search  (e  g.,  the  space  of  dog  breeds  used  above),  'fhe  genera!  problem  may  seem 
intractable,  but  a  number  of  successful  Soar  .sy.stems  have  been  implemented  which  use 
iioine  form  ol  recognition  ba.sed  memory  and  gencraie-aiid  test-based  letrieval  (Lehman  & 
(’onati,  199  V  Vera,  Lewis  ^  Lerch,  1993;  Huffman,  1993)  In  the.sc  models,  the  key  has 
been  lo  use  the  extenial  envnoument  orcurreiu  problem  solving  coiiiext  in  combination  with 
task  I'roblem  s['aces  to  constrain  the  generation  })roccss.  1  he  smiple  system  described  m 
i  Vera  el  al  ,  199.3)  takes  instruclions  for  rising;  an  automated  teller  machine  and  reconstructs 
tlie  instiui  tions  based  on  ihe  machine’s  altordances.  I'tie  sysK.'rn,  of  Hutlman  (  1993),  v\'}nch 
takes  uistiucaious  tor  a  rohot  arm  manipulating  a  blocks  world,  aeluallv  uses  NL  Sirar  tor 
die  iKiiniai  iangiiage  com[H.)iK‘[il  .ind  leconslructs  tlie  lUMiucfions  honi  p^eL:!.^ely  the  kind  ol 
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reference  resolution  chunks  described  above.  The  reconstruction  is  guided  by  the  existing 
task  knowledge  in  the  system. 

In  short,  what  NL-Soar  can  recall  about  a  discourse  is  a  function  of  what  else  it  already 
knows  about  the  objects  of  the  discourse,  as  well  as  the  cues  that  might  be  available  in 
the  current  situation.  Our  practice  in  building  Soar  systems  with  reconstructive  memories 
indicates  that  when  the  reconstruction  process  is  a  goal-driven,  situated  activity,  natural 
generators  can  often  be  found  to  keep  the  search  from  becoming  prohibitively  expensive. 


3.5.4  General  features  of  reference  resolution 

The  essential  features  of  NL-Soar’s  reference  resolution  can  be  summarized  as  follows; 


«  Reference  resolution  is  a  process  of  recognizing  parts  of  the  situation  model  as 
descriptions  of  previously  encountered  objects.  The  process  resolves  situation  model 
objects  to  constant  identifiers  in  long  term  memory  that  denote  known  objects.  The 
process  does  not  resolve  syntactic  phrases,  and  it  does  not  resolve  to  other  situation 
model  objects  or  previous  syntactic  phrases. 

•  The  process  is  incremental.  It  works  on  partial  descriptions  and  narrows  the  pool  of 
referents  as  information  becomes  available. 

•  The  process  builds  as.sociations  which  form  a  recognition  memory  of  the  content  of 
the  discourse. 

•  The  associations  are  fine-grained  in  nature,  so  that  the  information  associated  with 
any  particular  object  is  distributed  in  long  term  memory  across  many  chunks  (Miller, 
1993). 

•  Only  what  is  novel  m  a  discourse  is  stored,  bccau.se  opportunities  to  learn  new  chunks 
arise  only  when  recognition  of  the  content  fails 


•  Later  retrieval  of  the  eorueni  by  other  problem  solving  is  a  reconstructive  process 
m  which  existing  knowledge  sources  and  (Ire  current  situation  and  problem  solving 
context  plav  an  important  role. 

I'Ih'  CLui'.'iitl V  iniplenieiitei.1  processes  aie  lairly  simple,  and  much  work  lies  ;thcad  Ir' 
iiicr)i  poi  ate  richer  forms  of  infercru  mg  K.)  iiciernime  reference,  rhsemnse  segment  stiuclures, 
aiui  so  oil.  IfowcNcr,  .is  tlie  lisl  afrove  inakes  clear,  even  Ifiis  basic  siriKtuie  has  sonic 
inlcrr'sting  profx'rrics,  ami  its  1  uiictiunaliiv  lias  been  demotistiaicrl  in  at  least  one  apf)iic;Ui(i'.i 
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3.6  The  control  structore  of  comprehension 

What:  is  the  control  structure  of  comprehension?  This  question  is  at  the  heart  of  the 
psycholinguistic  debate  on  modularity  (though  it  is  rarely  cast  in  those  terms).  However, 
the  field  has  been  mostly  concerned  with  what  content  guides  the  flow  of  processing,  with 
less  attention  to  making  explicit  mechanistic  proposals  for  control  structure. 

NL-Soar  provides  a  well-defined  answer;  The  control  structure  of  comprehension  is  the 
open,  recogni:'.e-decide-act  control  structure  of  the  Soar  architecture  (§3.1.2).  This  has  a 
number  of  immediate  implications; 

•  Each  decision  in  language  processing  is  potentially  open  to  any  knowledge  source- 
syntactic,  semantic,  or  contextual.  There  is  no  restriction  on  what  aspects  of  the 
current  problem  space  context  may  be  tested  by  the  associations  that  comprise  the 
comprehension  operators. 

•  The  decisions  are  not  fixed  in  advance,  but  the  knowledge  is  assembled  at  the  moment 
of  the  decision  (via  the  elaboration  phase  of  the  decision  cycle).  The  control  flow  is  a 
function  of  whatever  knowledge  is  immediately  available  (via  the  recognition  match) 
at  the  time  the  decision  is  made 

•  The  control  knowledge  is  open  to  continuous  modification  via  chunking  Any  de¬ 
cision  point  in  NL-Soar  can  potentially  be  modified  if  new  associations  are  learned 
and  brought  to  bear  at  the  appropriate  time. 

In  the  next  few  sections,  we  will  explore  the  consequences  of  Soar’s  control  structure 
for  ambiguity  resolution  and  functional  parallelism. 

3.6.1  Syntactic  ambiguity 

Syntactic  ambiguity  manifests  itself  when  multiple  u-constructors  arc  proposed  in  parallel 
at  a  single  point  during  comprehension.  For  example,  consider  the  local  ambiguity  pre.scnt 
in  (80): 


(80)  rhe  doctor  told  the  patient  that . . . 

fhe  (  P  headed  by  that  can  he  attached  as  the  second  complemcnl  ol  told  oi  as  a  relative 
clause  modilying  patient.  .Suppo.se  that  the  operator  that  attaches  second  m.  ntcntiai  cornplc- 
meiUs  is  u-coiis!;ructor7  and  the  ()[)eialoi  that  furiiis  NPs  modduai  by  a  relative  clause 
is  u -conslM.ictor'2.  Then  both  ufreratuis  will  be  pro[)osed  upon  encounlei  ing  that  ni  (HO) 
(F'lgurc  3  24). 

How  should  tins  ambigniiy  he  resolved':'  fhe  Altmami,  (’lain,  and  .Stcerlman  woik 
(§2.3  ,'S)  picivides  ,1  knowledge  level  theory  toi  iesolvmg  anihu’uities  of  (Ins  l\  ju-  ihe 
iidat iiii.u.mi  Mie  slionid  depend  on  ilie  siRseess  oi  lailui''  ol  Ihe  siinpie  N4'  (in  this  e:ise. 
1',.  'll,'  [>atie>ii\]  to  imiijik  I'.'  leli'r  in  the  'nesiMit  (.ontc-st  It  the  sinipie  NP  lads  lo  seie\  t  a 
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Figure  3.24;  Syntactic  ambiguity  manifested  as  multiple  u- constructors.  At  a  local  structural 
ambiguity,  multiple  u-constructors  (two  or  more)  may  be  proposed  in  pai-allel.  Each  u-constructor 
corresponds  to  a  unique  structural  interpretation. 

single  referent,  then  choose  the  restrictive  relative  clause  attachment,  since  that  may  provide 
additional  information  to  narrow  the  set  of  referents. 

The  information  required  to  implement  this  strategy  is  present  in  the  utterance  and 
resolved  situation  models  because  reference  resolution  is  an  incremental  process  in  Nl.- 
Soar,  as  described  in  §3.5.1.  A  search  control  association  can  test  the  proposed  operators, 
the  current  utterance  model,  and  the  referential  status  of  the  situation  model  as  iollows: 

IF  operators  u-constructor7  and  U'Coiistructor2  are  proposed 
and  the  NP  in  the  assigners  set  refers  to  a  situation  model  object 
(8 1 ,1  and  the  situation  object  currently  has  more  than  one  identifier 
associated  with  it 

THEN  prefer  u~constructor2  to  u-const.ructor7 

Figure  3.25  shows  what  happens  if  this  chunk  is  in  recognition  memory  and  the  context 
IS  such  that  die  simp'le  NF  refers  to  more  than  one  individual.  Within  a  single  decision 
cycle,  [lOth  u-constructors  sre  proposed  in  parallel,  triggering  chunk  (81).  T'he  result  of  the 
de.''<.si')n  cycle  is  to  select  u  'con.';tructor2,  corre.sponding  to  the  relative  clause  reading. 

Siinil.ir  control  associalions  can  be  based  on  semantic  content,  lather  than  leferenlial 
context.  In  Chanter  4  we  will  .see  how  such  associations  can  arise  from  chunking,  and 
discu-ss  the  unplicatioms  <.>f  this  model  with  respect  to  modularity. 


3..CV.2  Semantic  ambi^^nily 

.Semantic  ambiguity  manifests  itself  when  uMdtiple  s  cunstrueiors  .sue  pioposeti  m  parallel  at 
a  single  point  dui  ing  comprehension.  For  example,  consider  the  le  .ical  ambiguny  present 
in  (82): 


( (S..’  I  ilia!  pudding  IS  I  ii  h. 

Ru  h  IS  aiii'i ji'/cued  ddh,!' nilv  dejiciidine  I’li  what  ttniig  i'  is  nioiSirvuig ,  1  liere  sc 

(Vso  senses  jnesent  in  ilic  syslein:  une  .sense  laliels  .soiiicilung  as  nrli  I'e  eat,  and  tlie 
othei  sense  labels  sonietiimg  .is  tuiaiu  laMv  wealtiiy.  SiqipOsC  th  ii  the  s  eon' n  (u.  lie  ■.  w  jin ii 
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FIGORE  3.25:  Kf“solving  syntactic  ambiguity  recognitionally.  The  structural  alternatives  (u7  and 
u2  are  proposed  in  parallel.  This  triggers  an  association  (chunk  81)  that  encodes  knowledge  about 
which  path  is  preferred. 

interpret  the  predicate  complement  construction  with  rich  are  s~constructor97  (food) 
aad  s~construc.tor44  (wealthy).  (The  set  of  s-constructors  evoked  by  a  particular  word 
do  not  riece,ssarily  map  one-to-one  to  a  set  of  senses  of  that  word.)  As  in  the  case  of  syntactic 
ambiguity,  both  operators  are  proposed  in  parallel  at  rich.  The  semantic  preference  for  one 
sen.se  over  the  other  is  captured  by  the  association; 


]/•  s-constructor97  and  s-coiistruct,or44  are  proposed 
(.83)  and  referent  of  the  subject  has  the  property  isa  food 
THUN  prelcr  s  -coiist];uctor97 

There  is  another  possibility  for  resolving  semantic  ambiguity:  build  semantic  con¬ 
straints  (selectional  restrictions)  into  the  generator  for  the  s-constructors.  In  such  a  scheme, 
c  -consti  uct;or44  would  not  be  proposed  in  (82)  because  the  rcquircruent  that  the  wealthy 
sense  ot  ncti  modify  a  person  is  not  met.  This  could  be  implemented  via  a  series  of 
constr  imt  r  hcck^  m  a  manner  similar  iss  itie  syntaefie  consiramt  checking  In  fact,  earlier 
vcisions  ot  .Nl,  Soar  did  take  such  an  approach  (l.elmian  e!  a  !‘*0|;!) 

I’hc  jirohlcm  with  ihi;-  appioach  is  ih.it  semantic  constraints  are  inherently  la  eicrentia! 
ihr  v  arc  not  alisoicie  hiteis  This  ii.is  heen  pointed  out  by  Walks  (  Id/S)  .kkI  otbci's  riie 
i.aaua at h )i:  of  n!ulti[i|c  .altei natives  aliows  knowledge  encotled  is  piefeiences  to  I'e  hsouglU 
l<r  licai.  l  iu  lliei  iiiow,  no  maitei  hi;\\  mueh  of  the  knowledet'  e.  eru  oded  nJo  the  r  eneiatoi 
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Table  3.2:  Mapping  comprehen,sion  functions  onto  problem  space  functions. 


Comprehension  function  Problem  space  function 


Parsing 

Lexical  access 

Generation  of  syntactic  alternatives 
Selection  among  syntactic  alternatives 
Construction  of  syntactic  structure 

Encoding 

Operator  proposal 
Search  control 

Operator  application 

interjiretation 

Generation  of  semantic  alternatives 
Selection  among  semantic  alternatives 
Construction  of  semantic  representation 

Operator  proposal 
Search  control 

Operator  application 

Fteference 

resolution 

Operator  application 

there  is  always  the  potential  for  an  ambiguity  to  arise  that  is  not  filtered  out  by  the  fixed 
generators.  In  that  case,  the'e  is  no  recourse  but  to  resolve  the  ambiguity  via  control 
associations  similar  to  (83).  The  general  issue  of  how  knowledge  should  be  distributed 
across  problem  space  functions  (prop)osal,  selection,  implementation  of  operators)  will  be 
addressed  systematically  in  §3.7. 

3.6.3  Functional  parallelism 

Pa.rallelism  is  an  inherent  part  of  Nl.-Soar  because  Soar’s  recognition  memory  is  a  parallel 
system.  This  paraMelisrn  ari‘'es  in  two  ways.  Ine  match  ptoccss  itself  is  massively  parallel 
because  all  associaiions  are  continuously  matched.  And,  once  matched,  the  associations 
lire  in  parallel. 

A  more  informative  characterization  of  parallelism  in  NL-Soar  is  made  possible  by 
breaking  down  the  comprehension  process  into  functional  cornporienls  and  analysing  (he 
parallelism  in  functional  terms.  Table  3.2  .show.s  the  functions  of  comprehension  and  thmr 
mapping  onto  problem  space  functions  (encotling,  operator  proposal,  oix  rator  selection, 
and  operator  application)  in  NL  Soar.  Using  this  mapping,  we  can  corrsidci  paralk-lisni 
within  I'uiictioti,  and  parallelism  (u  ross  lunctions. 


i’araHelisni  within  funetions 

Paial'ciism  within  function  relers  to  the  slirmitaiieilv  of  disiini  t  piroccsses  that  ie;ili/e  a 
panicuiar  cornfireiiciision  tanction.  In  the  top  level  \pa<,e,  e.;(  h  oi  the  tuin  I'oio  eaii  nc 
ri  ab/ed  wilhm  a  smgle  decision  cvcle.  I  lie  deciMOn  syclc  is  the  Ic.'el  ot  seiml  conliol 
.11  .Soar  VVilliiii  !!!i:  decision  cvele,  the  lecogriUioii  inenion  duo  asso.  laiiniis  m  n.nallel 
ainJ  in  -cijiicnn.’-  Ilic  jiaraneli.sni  is  limited  only  in  the  iidieicni  ilal.i  liependcncies  in 
llic  |!Ohs  Nsiii;'.  I  iins,  iiiCK'  Is  the  nutentia!  lor  (laialleliMii  in  all  ot  ilii'  v  o[ii|iit’iieiision 
Inncliuns,  In  fae!,  ihen  is  p.iialleiiMii  niilini  ail  ot  the  tuiK  in'iis,  tk  ;  iL, a!  in  the  t,'llo\>.  inn 
hsi 
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•  In  lexical  access,  the  associations  that  represent  the  different  categorial  senses  of  a 
word  fire  in  parallel . 

•  In  syntactic  and  semantic  generation,  the  associations  that  propose  the  applicable 
comprehension  operators  (u-constructors  and  s-constructors)  fire  in  prarallel. 

•  In  syntactic  and  semantic  selection,  the  associations  that  evaluate  the  alternative 
operators  fire  in  parallel.  For  example,  suppose  that  three  operators  are  proposed  U\, 
U2,  and  f/3.  The  associations  establishing  the  preferences  U'l  >  U}  and  U2  >  Vy 
may  fire  in  parallel. 

•  In  syntactic  construction,  the  primitive  sfructure  building  associations  which  com¬ 
pose  u-consiructors  may  fire  in  parallel.  For  example,  in  forming  the  noun  phrase 
Imp  the  red  block],  the  associations  which  adjoin  the  adjective  phrase  red]  to 
fw'  block]  can  fire  in  parallel  with  the  associations  which  project  block]  to  the 
maximal  projection  NP. 

•  In  semantic  construction,  the  primitive  .structure  building  associations  which  compose 
s-constructors  may  fire  in  parallel.  For  example,  in  forming  the  situation  object  cor¬ 
responding  to  the  newly  created  NP  l/v,.  the  red  block],  the  associations  e.stablishing 
the  properties  color  red  and  isa  block  fire  in  parallel. 

•  In  reference  resolution  Ihc  associations  that  recogni/o  parts  of  the  situation  model 
retrieve  identifiers  of  potential  relerents  in  parallel 


Paralleitsin  across  t'anetioiis 

Parallelism  across  functions  refers  to  the  sinuiltaiieity  o!  [Moeesses  that  reaii/e  diltereni 
comprehension  luiictions.  flie  one  at  a  time  .ipplK.iImn  ot  o[ierators  m  .Soar  iniposes 
aictiitecliiial  limit, ilions  on  tiinclioiial  paralk  lisin.  liinKcver.  iht'  senal  siicain  of  o[>e;ators 
oiilv  restricts  [i.iisillclisin  in  ojieiaior  .ip|)!icalioii  -proposal  .irul  scicciaon  m.iv  g‘,»  (Ui  in 
parallel  wilh  each  olhci  and  with  .ipplication, 

in  Nl,  .Soar,  this  inc.nis  s\ntai'(lc  slfiicline  builiiiiip,  semantic  streeliire  iniiKiaig,  and 
n'icrniiv  re, so  Ini  ion  cannot  ti.ij)|)en  in  p.n.illcl  Hui  tile  Inn 'in  mis  ol  h  -  \  estl  access,  ss  ni.icl  a 
,ui(t  scin.ailic  gcnci alii'ii,  .iiui  svntaclic  .iiul  sciii.iiilic  si'icriion  all  m.;\  h.i['|H,'ii  in  pai.illci, 
1  ir’iiic  '  .’()  sliovrs  ihia  gi  .ipliu'.ilh,  file  lioii/'oui.!l  .iri-^  icprcscnis  Iniic  1  tic  (op  ihicc 
l)0\cs  ri.prcscnl  ihc  -.cm.iIiIn  i>l  suiiclnic  I'lnlvlni;’,  and  iclrn.-iuc  icsolutuni  -Xus  vcilical 
slu  e  ihiiMiLdi  die  ticuic  icpicscnts  :i  sci  oi  link  lions  iti.il  i  .111  polcnti,dl\  h.ipj'cii  111  p.n.ilh.'l 
dm  11';'  I  lie  s.imc  iks  imomi  i.  \  cIc 

.\s  VC  i  .111  see  lioiii  iigurc  '  .’o  .ind  itu-  an.iK  sc,  ot  w  nhm  Imi  lien  p.n  allc  I  c  1 1  c  M  . 
Siiaf  s  -.uniKM  sii  luimc  \  icld-,  .1  mix  cd  senai  .ind  p.n.iliil  p-i  o>  cssme  1 '.ii  ,il.!ci!siii  \mllim 
liMK,  lion  c,  Imiilcd  oiil\  h\  iiiiiciciit  d.n.i  dcpciuiciK  ics  I'.n.dlclisin  .icio  .s  tniicm  n  i 
liciilcd  p-,  [In-  sell  liilx  o|  So.ii  s,  dccisioii  I  rclc  srlncii  icsnicl-.  ■.imiill.nicii'-  o!  .'priscoi 
.ippl  ii  .11  ion  r'cl  .11  .iu\  CIS  CM  moi  I  lent .  cci  l.im  .ispecis  1  li  -o  ni  .u  !  u  .  -NCinanl  ic  .uid  i  cl  ct  ci  M  nil 
I'l  ix  !'s  a : ! M  in, IN  he  h.ippcnmc  in  p.u.iUcI 
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Syntactic 

Semantic 

Reference 

structure  building 

structure  building 

resolution 

Semantic  generation 


Syntactic  selection 


Semantic  selection 


Lexical  access 

Syntactic 

generation 

L _ 

FiGURH  3.26;  P^iralloiism  across  (unctions  in  NL-Soar.  Time  is  along  the  horizontal  axis.  Any 
vertical  slice  through  the  tigure  cots  through  functions  that  may  happen  in  parallel 

3.7  Evalijating  a  space  of  alternative  models 

The  previous  sections  assiuned  a  particular  .structure  for  comprehension  operators:  the 
opt'raiors  come  in  three  types  (u-construciors,  s-constructors,  and  resolve  operators),  and 
the  constructors  have  an  unixunided  set  of  tokens  (u-constructor7,  u~const  ruc.toi-44, 
etc.)  Now  thtit  the  nature  of  tliis  structure  has  been  explored  fairly  thoroughly,  we  can 
step  hack  air.l  examine  the  space  of  .dternalive  comprehension  operator  schemes  and  the 
moiivatiou  (oi  itie  ('leseit;  niotlcl. 


3.7.1  'I'hp  ■  pufe 

I'hc  lollnwn!!.'  live  p.ir.irncteis  deline  a  spaci-  of  Nl.  .Roar  models  liv  s[H’cilvina  \aiious 
.ispecis  ol  eiimpieheriMon  upei.iiois  I  he  ji.ii .mielei  x.ilues  simpiv  enumerate  the  aia  liitec 
mr.tib  peimisNihle  \\.i\s  ol  le.ih/ui!',  compiehenstoii  o|x-i.ilors  flie  p.u.imete:  values  are 
nol  all  mdei'eiulcni,  .in  issue  to  be  evplu  itiv  .idiliessed  l.Uet  m  this  sei  t.on 

I’l  (.  ompieiiei isuMi  MjH'i .lU .i  tv  pcs 

fins  <  l.issi  111.  ,!i  ion  h\  oiilpui  whit  mndels  .ire  piodrucd  I  iiUeiaiKe  nwilei. 
.S  ‘.ilii.ilioii  model,  Iv  lesol'.ed  silu.ilioii  model  boi  .(  s'lveii  wioJ,  ,)ii\  luSisel  ut 
I  ij  'I'l  .ill i|  i  V  j H  s  'ii.iv  be  e V  okr’d  lo  |H"i  loi  111  eompreiu  iisn  m  i  I  he  modi.  I  i hseiisM.-d  i ii 
l  !i  Is  eli.i[  ilei  11  a  !  espoiu  Is  lo  \  .tine  i  d  1 1 
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(c)  V. . >  S  R 

(d)  . ‘  U,  -->  S,  R 

P2  Opet  ator  tokens  per  type 

If  there  is  one  comprehension  token  per  type,  then  tha'  token  is  always  selected  and 
applied  to  each  word.  If  there  are  multiple  tokens  per  type,  then  each  token  may  be 
evoked  try  various  aspects  of  the  current  coiaprehension  context.  (The  present  model 
corresponds  to  value  (b)) 

(a)  One  operator  token  per  type 

(b)  Many  operator  tokens  per  type 

P3  Detection  and  representation  of  ambiguity 

Since  true  local  ambiguity  is  unavoidable,  there  must  be  some  way  to  detect  and 
momentcirily  represent  the  ambiguity  so  that  knowledge  may  be  brought  to  bear  on 
the  choice.  The  possibilities  for  aetecting  and  representing  amlriguity  include  two 
architectural  mecharasms.  (The  present  m(.tdel  corresponds  to  vahie  (a)). 

(a)  Context  slots  (e.g..  operators,  states)  f architectural j 
tb)  Attribute  impasses  [architectural] 

(c)  Special  data  structures  [non-architectural  j 

P4  Distribution  of  knowledge  across  proposal,  selection,  and  ’mpieineritalion  of 
prehension  operators 

The  multiple  knr/wledge  sources  that  must  be  brought  to  hear  '.i  comprehension  rnav 
be  distributed  in  diffesent  ways  across  the  functic'is  of  proposing,  selecting,  and 
implementing  operators.  (The  present  model  distributes  most  of  the  knowledge  in 
proposing  and  selecting  operators;  for  example,  all  syntactic  const'-aiiii::  are  chunked 
irilo  the  operator  projrosals.  This  conc.sponds  to  vaiu-  { .;)  ) 

(a)  MoNi  knowledge  m  iniplemcnlation 
(h)  Most  knowledge  in  proposal 
(c)  Mt)sl  kinawledgc  m  p!t>po.sal  and  seleciioii 
!  1)  Distributed  across  all  llitee 

I’s  [niliai  lexical  access 

flic  initial  retncal  ol  lexictdly  specific  kiiowlet-igt  may  he  contcvi  deiteiulcin  (!ii*a\ 
nil'  1  !U!\s  leilge  to  the  geneiator)  oi  context  indejiendeiit  iniakii'i'  du-  in  iieralor 
ioiowlciiee  le.ii;),  ilii  C’-k  ii  '\'as  noted  thai  v'xpei imeiila!  exKleiite  seems  to  !a 
\oi  eoiikxt  mek  peiiiieni  aeeess  ilu"  piCse'it  model  l:..kes  diis  louie,  cories[)t  iViiiiiie 
to  .due  I  h  I ) 

■  Set  t!;;-  o;ii  •: !  iiid  el  ,il  ’  u‘)  r )  idi  .im  exphiiialiun  ol  .lent'iili;  iinjiae',  '  , 
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Table  3.3:  Dependencies  in  the  parameter  space. 


P2a-->P3b/c  and  P4a 
P2b— }  3a  and  P4c/d 
P3a-P2b  arid  P4b/c 
P3b-^P4a/d 
P3c--P4a/'d 
P4?.— P3b/c 
P4b— P2i .  and  PSa 
P4c -P2fa  and  P3a 
P4d-~F2b  and  P3a 


(a)  Context  dependent 

(b)  Context  independent 

As.  mentioned  above,  the  parameters  are  not  completely  independent — selecting  certain 
values  for  certain  parameters  hxes  the  settings  for  other  parameters.  In  particuiar,  P2,  P3 
and  P4  are  all  interdependent.  For  example,  having  many  tokens  per  operator  type  forces 
knowledge  to  be  in  the  proposal  and  selection  of  comprehension  operators  (P2b  — >  P4c/d). 
The  complete  set  of  dependencies  is  given  in  Table  3.3. 

3,7.2  The  evaluation 

The  models  in  this  space  will  be  t  valui/ed  against  three  basic  <  omputational  and  functional 
criteria: 

Cl  Transfer.  Evaluates  schemes  based  on  how  they  affect  the  specificity  fsf  ihe  chunks 
that  realize  recogiiitiorial  comprehension  in  the  top  space.  Along  this  dimension, 
schemes  tiial  build  more  general  chunks  are  rated  bcttc'  than  those  that  build  specilic 
chunk:L 

C2  Asyrnplcjtii  ejhciency.  Evaluates  schemes  based  on  the  maximum  achievable  ef- 
liciency  of  recogiiitiona!  comprehension.  That  is.  comprehension  afrer  chankiny, 
assiirniiig  cliiink'  itansfer.  7he  Ic.ss  recogmtioiial  opcratois,  the  more  efticicnl  llic 
scheme. 

t  ’  ^  (liaraett"-  af  iinph'inrnliinon.  ..ivaluaics  .scheiiies  based  on  the  s!in|)iK  ity  .‘.fid  econ¬ 
omy  of  llie  data  sinieluii'.s  aiu!  ffroce^i-sc.s.  This  is  of  cajurse  a  subjective  measure,  but 
11,  IS  iflleii  iiossilile  lo  make  -clear  (jualitaii vc  dis.iiiu  oons.  This  ciHcnc.n  esseii! lall y 
esaluates  altei fiali ves  based  cii  how  nal'iealiy  lh':v  tit  m  the  St'ai  architecture.  The 
more  lasers  ot  inechamsiii  ic.jiured  Uf  irnplcr.iriu  a  siilution,  die  h'Ss  nafuici!  fhe 
Sohll  fi  C  :  IS 
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Table  3.4:  IndepL'ndent  evaluation  of  paranieier/value  pairs. 


Cl:  Transfer  C2:  Asymptotic  efficiency  C3: 

Simplicity 

PI  a 

Plb 

Pic 

Pld 

-r 

0 

0 

0 

+ 

0 

+ 

P2a 

P2b 

P3a 

P3h 

P3c 

..c 

P4a 

P4b 

P4c 

P4ci 

-  + 

4 

P.3a 

P5b 

+ 

+  - 

Relative  evaluation:  *-  better  than  o  better  than 

No  entiy  me;ins  no  effect 


I'he  total  efiiciency  of  the  system  is  a  function  of  tlie  transfer  rate  (Cl),  recognitional 
efficiency  (C2),  and  the  efficiency  of  deliberate  comprehension.  Deliberate  comprehension 
efnciency  is  not  an  e  valuation  criterion,  since  the  schemes  do  not  differ  along  this  dimension. 

Table  .3.4  gives  ati  independent  evaluation  of  each  parameter/value  pair  with  respec*  to 
each  critei  ion.  Titis  is  a  simple  direct  evaluation,  not  taking  into  account  the  dependeri>,;ies 
noted  in  Table  3  3.  From  Table  3.4  an<i  the  dc))endencie.s  in  Table  3..’,  we  can  compute  a 
complete  independent  evaluation,  shown  in  'Tibie  3.3. 

I'ioni  f’iis  evaluation,  Pia,  Plb  and  PId  is  rrefened  to  I’lc;  P2b  is  prefciTCil  to  P2a  P3a 
js  faelcrrcd  to  P3b  atid  P3c;  P4b  and  FMc  h.  [ticfcrred  to  P-Ta  and  P4d,  We  can  recv;i!uate  the 
pvratncii'rs  in  the  rvstrictcii  soace,  fixing  tlic  pat auieter-.  P 1  =  { a,b,d } ,  P2~4.l  P3='ai.  F4  •“■{  b  c } , 
,t:  d  P.3:=  ( A.b  ] .  I'a.hie  3.b  give;,  the  rc.suil:..  'i'hc  j,)referrcd  .system  correspoads  to.  PI  ■  ■  d,  P2 
;  1),  P3  -  a,  P4  i. ,  ani.i  Pb  -  b  vx’hich  is  the  niodei  presented  in  this  cliapter,  Tlies,  the  biisse 
'-truciim-  o!  <  orojirclvvnsion  ojx  rators  and  lexica!  access  is  not  an  arbitr.iry  choice,  tin!  oikv 
g!iulc:l  tiy  dsc  funcUona!  impbeations  o.;  !niple;ncrit!r.!g  various  sees  of  nie.c!i,arns.ris  wuh.i'i 
the  So,;ir  arvlnlcc  lure. 
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Table  3.5:  Evaluation  of  paiamerer/value  pairs,  taking  into  account  the  dependencies. 
Cl:  Transfer  C2:  Asymptotic  efficiency  C3:  Simplicity 


PI  a  -t-  o 

PIb  o  + 

Pic  O  o 

Pld  ~ 

P2a  ”  '  . .  ~ 

P2b  + 

P3a  . .  “  T 


P3b 

P3c 

P4a  '  '  ‘ 

P4b  -  +•  r 

P4c  +  -  + 

P4d  + 

P57 

P.5b  + 


Rc,!ativc  ev.'iluation;  +  better  tiian  o  better  than 
No  entry  means  no  effect 


Tabi.E  .3.6:  Evaluation  of  parameter- value,  pains  in  re:4rictc<l  subspace 

(.'1:  Transfer  C2:  Asymplo'jc  efficiency  C3:  Simplicity 

"Pla . . . . . f . . . . . . . 

1Mb  o  o  + 

Pld  t- 

'P4b . . . . + . 

i^-tC  +  o 

!\3:i  •» 

pSb  I 

fb.Mauve  e’>aina!ii)n  r  beiier  (ban  lieuci'ihan 
No  calf  V  mcasis  no  c!let.  i 
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3.8  Sionniary  of  the  theory 

The  following  is  a  summary  of  the  basic  principles  of  NL-Soar.  No  attempt  is  made  in  this 
list  to  clearly  separate  the  contributions  of  the  Soar  architecture;  that  issue  is  dealt  with  in 
Cliaptcr  9. 

1 .  Comprehension  operators.  NL-Soar  comprehends  language  incrementally  by  apply¬ 
ing  a  series  of  comprehension  opeiators  to  the  incoming  input.  There  are  three  kinds 
of  operators;  u -constructors,  which  build  a  syntactic  representation,  s-constructors, 
which  build  a  semantic  representation,  and  resolve  operators,  which  perform  refer¬ 
ence  resolution.  The  constructors  may  be  composed  of  multiple  primitive  structure 
building  operations,  and  there  is  no  fixed  limit  on  the  vocabulary  of  possible  opera¬ 
tors.  Each  constructor  is  denoted  by  a  unique  constant  symbol;  the  processes  are  not 
represented  by  data  structures  which  must  be  interpreted  to  yield  behavior.  Operators 
take  on  the  order  ol  50  ms  to  complete. 

2.  Comprehension  as  a  continually  improving  mix  of  deliberate  and  recognitional  be¬ 
havior.  Given  the  real-time  constraints,  comprehension  must  proceed  mostly  by 
recognition.  When  the  required  knowledge  is  not  immediately  available,  NL-Soar 
falls  into  problem  spaces  that  carry  out  the  comprehension  functions  deliberately, 
bringing  together  independently  represented  knowledge  sources.  A.s  a  result  of  this 
problem  solving,  NL-Soar  automatically  learns  new  associ;-  tions  tint  directly  accom¬ 
plish  compreliension,  continually  shifting  Soa  from  deliber..;tion  to  recognition.  (The 
model  does  not  specify  the  top-level  associations,  only  the  lower  problem  spaces.) 

.3.  Model  represeniation  of  syntax,  meaning,  and  reference.  Problem  space  stales  in 
NL-Soar  are  annolatctl  models  (pure  models  obeying  structure  correspondence,  with 
annotations  of  limited  .scope  which  increase  the  representational  power  or  help  control 
processing)  repre.senting  one  particular  situation,  Comprehension  operators  build  two 
kinds  ol  .model  in  working  memory.  The  utterance  model  represents  the  X-bai  phrase 
structure  of  the  utferv.iKc,  The  .situation  model  represeiils  the  particulai  situaiton  that 
the  mteiance  ts  about,  decomposed  mto  objects,  properties,  and  rclalioiis  drawn  f  rom 
a  riei)  ontology. 


•1.  l.imiicd  svniactu’  irnicK  jor  unerance  model.  TliC  nodes  in  the  uUt  i.uiee  motiel  are 
indescd  iit  working  merouiy  by  tbeir  potential  syntactic  .elation:,,  in  .i  structuie  called 
the  A/R  set,  l.'.'ach  assigning  o,i  icceiving  i elation  inde;';es  at  mo.^t  Iwo  nodes.  Ail 
processes,  inciuding  semantic  inierpictation,  acc.e-,'  tlie  utleiancc  model  via  the  /UR 

sc  ! 

'V  (  ( 'uliw  i ■  indi’peiulerii,  Oii f o i tt'l  Icxicn!  u,'  cr  .v.s.  Iriiiia!  Icxic  ai  .ic  ccss  rels  le  es  all  csti'e 
;:or:,!i  senses  oi  ...i  'asucI  tn  paialiei.  iiidependeni  of  tfic'  picsenl  s\  !itaeii>e  or  '•e'nantie 
t .  I  lie  res:ilis  (.>!  ;e\ic'ai  eic  cess  are  l.i.n -level  /.ero  ranlc  s  Iiu  1:  .ne  i>lat  ed  ni  'he 
A/K  SC! 
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6.  Head-ilriven.  conslrainl-hascd  construction  of  utterance  model.  The  construction  of 
the  utterance  model  is  a  head-driven  process  which  begins  with  projection  of  nodes 
from  incoming  lexical  heads.  There  are  no  explicit  expectation  structures.  In  the 
lower  problem  spaces,  independent  syntactic  constraints  are  applied  to  check  the 
well-formedness  of  putative  structural  links.  There  are  no  explicit  phrase  structure 
rules;  syntactic  structure  emerges  from  the  interaction  of  lexical  projections  with  the 
independent  constraints.  The  generate-and-te.st  problem  solving  produces  chunks 
that  integrate  the  multiple  constraints. 

7.  Simple  destructive  repair  mechanism.  Incoming  input  that  is  inconsistent  with  the 
current  utterance  model  can  result  in  a  momentary  parallelism  of  structure.  The 
inconsistency  is  repaired  by  a  simple  destructive  repair  mechanism.  The  mechanism 
consists  of  the  snip  operator,  which  breaks  a  structural  link  in  the  utterance  model,  and 
the  existing  link  operators,  which  perform  the  reconstruction.  The  generation  of  snip 
is  highly  constrained.  It  is  only  proposed  in  two  cases:  when  competing  syntactic 
senses  of  the  same  lexical  token  have  been  incorporated  into  the  utterance  model, 
and  when  a  structur  il  inconsistency  is  detected  local  to  som.e  maximal  projection. 

8.  Reference  resolution  as  recognition  of  semantic  de.scriptions  Reference  resolution 
in  NL-Soar  is  a  recognition  process.  Resolve  operators  are  applied  to  parts  of  the 
situation  model  in  an  attempt  to  recognize  the  model  as  a  .semantic  description  of 
a  known  object.  The  content  of  the  discourse  is  held  in  long  term  recognition 
memory,  which  arises  automatically  from  an  assimilation  process  that  is  evoked  when 
recognition  fails.  Memory  for  content  is  necessarily  a  reconstructive  process  which 
attempts  to  triggei  the  recognition  chunks.  This  process  is  driven  by  a  combination 
of  the  immediate  situation  and  exi.sting  ta.sk  knowledge. 


9.  Open,  mixed  parallel/.serial  control  structure.  The  control  structure  of  NL-Soar 
is  open.  Any  knowledge  source  may  be  brought  to  bear  to  modulate  the  How  of 
processing — if  the  knowledge  is  immediately  available  in  the  recognition  memory. 
The  control  knowledge  is  open  to  continual  niodilication  via  chunking.  The  control 
structure  admits  a  mix  of  parallel  and  .serial  processing.  There  is  parallelism  w'llhin 
every  comprehension  function,  limited  only  by  inherent  data  dependencies  Theie  is 
patallelism  acioss  all  comprehension  functions,  with  the  exception  of  the  apjilication 
ot  compiehensirin  operators,  which  r)ccurs  in  a  serial  stream. 
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This  chaffer  describes  how  NL-Soar  accounts  for  some  of  the  major  phenomena  sur¬ 
rounding  structural  ambiguity  resolution.  Garden  path  effects  are  not  discussed — 
that  is  the  subject  of  Chapter  6.  Here  we  focus  on  the  processes  of  ambiguity 
resolution  per  se. 

The  review  of  the  empirical  literature  in  Chaptei  2  revealed  that  the  phenomena  of 
ambiguity  resolution  are  fairly  complex.  There  is  evidence  for  interactive  effects  across 
a  range  of  syntactic  constructions  and  context  types.  There  is  also  evidence  for  modular 
effects — the  failure  to  bring  to  bear  certain  knowledge  sources  on-line — across  a  range  of 
constructions.  Of  those  structural  parsing  preferences  so  far  proposed,  some  form  of  right 
association  and  lexical  argument  preferences  appear  to  be  the  most  robust,  in  both  linguistic 
analyses  of  corpora  and  in  behavioral  studies. 

The  next  two  sections  demonstrate  how  NL-Soar  accounts  for  both  modular  and  in¬ 
teractive  effects,  drawing  directly  from  the  structure  of  the  model  presented  in  Chapter  3. 
The  iinal  .section  summarizes  the  NL-Soar  theory  of  ambiguity  resolution,  and  draws  .some 
genera!  conclusions. 


4.1  Modular  elTects 

Modular  cflccts  in  ambiguity  resolution  can  arise  in  two  ways  in  Nl.  Soar.  First,  Nl.-Soar 
may  completely  fail  to  detect  an  ambiguity,  m  which  t  ase  knowledge  cannot  be  brought 
to  bear  to  resolve  it.  This  is  the  most  severe  bretikdown  of  ambiguity  resolution  possible, 
siriee  ihe  efferts  olten  eannot  be  overcome  w'itli  additional  knowledge  oi  experience.  I  he 
second  kind  ol  breakdown  involves  a  failure  to  bring  the  rer|U!rcd  knovviedge  to  Ireat  on 
the  ambiguity.  Hoih  kinds  may  give  use  to  a[.)[):ue!!l  sinietuia!  [irefereriees  m  a  xariety  of 
w'avs.  Is  described  in  the  next  live  sections 
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4.1.1  The  limit  of  two  attachment  sites 

The  strongest  prediction  that  the  A/R  set  makes  about  ambiguity  resolution  is  that  at  most 
two  nodes  are  available  to  assign  the  same  structural  relation  at  any  given  time.  In  a 
structure  of  the  form 


Xn 


only  two  of  the  nodes  will  be  available  to  assign  any  particular  structural  relation,  even  if 
all  n  sites  are  grammatically  open.  For  example,  consider  the  right  branching  structure  in 
(85): 


(85)  Stewart  saw  the  dog  under  the  box  on  the  table  in  the  room  next  to  the  library. 

At  most  two  noun  phrases  available  to  assign  the  adjunction  relation  (adjoin-N’)  to 
prepositional  phrases. 

Thus,  the  A/R  set  serves  a  theoretically  similar  function  to  closure  principles,  which 
predict  when  syntactic  nodes  are  closed  for  funher  attachment.  The  best  known  are  Early 
Closure  (Kimball,  1973)  and  Late  Closure  (Frazier  &  Rayner,  1982)  (§2.3.1).  Church 
(1980)  provides  an  empir=  al  critique  of  both,  demonstrating  that  Early  Closure  over¬ 
predicts  difficulty  and  Late  Closure  under-predicts  difficulty.  He  offers  the  A-over-A  Early 
Closure  principle  as  an  alternative  with  significantly  better  coverage'.  The  critical  idea  is 
that  the  two  most  recent  nodes  of  the  same  category  (hence,  the  A  over  A)  may  be  kept 
open  at  any  time.  This  is  similar  to  what  the  A/R  set  predicts,  with  the  exception  of  the 
pure  recency. 

One  way  of  directly  testing  the  theory  is  to  construct  material  with  three  potential 
sites  and  syntaciically  force  attachmeiu  to  each  of  the  three  siites  as  the  experimental 
manipulation.  NL-Soar  predicts  that  one  of  the  sites  should  can.se  difficulty,  givitig  an 
impression  of  ungraminaticality.  Recenlly,  Gibson  et  al.  (1943)  conducted  a  study  using 
material  with  three  potential  NP  attachment  sites,  and  found  that  forcing  attachment  to  one 
of  the  sites  (the  intermediate  site)  cairsed  difficulty,  .Although  this  study  is  not  the  best 
|)ossible  test  of  the  theory’,  the  same  pattern  of  results  held  in  an  analysis  ol  three-site 
Nf’-PP  attachments  in  the  Brown  corpus;  attachment  to  the  intermediate  site  occurred  pist 
14%  of  the  time  (Gibson  Sc  Pearlrnntter,  1993  ) 

I  he  theory  as  slated  does  not  |nedict  which  Iwoot  the  three  torn)  sites  will  be  av.irlahle, 
since  i!  does  not  spccil  v  any  pailicniai  strategv  foi  deteiimniiig  whu  h  node's  remain  in  the 

'  ( iiiisnn  1  1  on  I  )  Uses  .1  inoilifilsi  \ s  i  mud  i  tt  rhi ,  pi  iiu,  i|)le  (ilk-  trim  iplr  oj  1‘tfli'ieiu  t  (  h  ’.ui/c  i 
Hceausi;  itk'ie  \^.l^  u  iiuunciiCiiV  loial  amhiieiil v  aa  iiklepiTkirrit  e,inicii  path  ethsl  inav  li.io  Iki-ii 
in  \  1 1;  \  fit.  w  tiu  li  i(  nil  pi  I  .alt’s  I  hr  irilfrpiOl.iliiiii  nt  ttre  if  suits 
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A/R  set  and  which  r.Ovles  are  replaced.  One  obvious  possibility  is  to  introduce  an  explicit 
recency  preference,  so  that  the  two  most  recent  nodes  are  held  in  the  A/R  set,  but  the  data 
above  suggests  that  this  may  not  be  correct,  since  the  more  recent  intermediate  site  was 
significantly  more  difficult  than  the  initial  NR 


4.1.2  An  emergent  recency  preference 

Although  the  Gibson  et  al.  (1993)  study  indicates  that  recency  alone  cannot  account  for  the 
data,  a  genera!  recency  preference  that  can  be  modulated  by  other  factors  may  still  play 
an  important  role  (Gibson,  1991).  In  fact,  by  abstracting  away  from  the  effects  of  any 
particular  strategies  for  ambiguity  resolution  or  handling  conflicts  in  the  A/R  set,  we  can 
see  that  the  basic  structure  of  the  A/R  set  does  give  rise  to  a  kind  of  recency  preference. 
More  precisely, 

(86)  A/R  set  recency  preference:  Given  a  sequence  of  syntactic  nodes  ..  .x,,, 
n  >  2,  that  potentially  assign  some  structural  relation  p,  attachment  to  more 
recent  nodes  via  p  is  more  likely  than  attachment  to  less  recent  nodes,  all 
things  being  equal. 

This  preference  can  be  derived  with  a  simple  probabilistic  analysis.  Let  Ps{x)  be  the 
probability  that  node  x  will  be  selected  as  an  attachment  site.  Let  -<  be  the  precedence 
relation  among  nodes,  such  that  x  -<  y  means  y  is  more  recent  than  x.  Then  the  general 
statement  of  recency  preference  is 


If  X  -<  y  then  Psix)  <  Psiy)  (4, 1 ) 

A.ssume  that  X]  ,X2 . denotes  a  sequence  of  syntactic  nodes,  so  that  if  i  <  /,  x,  •<:  Xj. 

Let  Pmw(x)  be  the  probability  that  node  x  is  in  working  memory  indexed  by  some 
assigning  relation  p.  Let  Pk^x)  be  the  probability  that  search  control  knowledge  selects 
node  .V  in  the  A/R  set  for  p-attachment.  Then  the  probability  Ps  that  a  node  will  be  selected 
as  an  atlachincnt  site  is  the  probability  that  the  node  is  in  working  memory  and  selected  by 
search  control: 

/A(  u  •-  f\v\A  x)pKix)  (4.2) 

We  ab:, tract  away  fi'.mi  the  eff/xis  ol  search  control  knowledge  by  assuming 

bbr  all  noiles  gv';  Prix)  -  P,\iy)  (  !..L 

bach  lime  an  alU'nifil  is  nuule  lo  (liacc  a  new  node  in  ’lie  A/K  -et  '.mdei  some  iiulc.x  /s, 
llicr.  :s  c  pool  ot  tlnee  |>otcntial  candidates:  tlie  two  cuiienl  nieinbets  m  the  set,  aiui  iIk 
V  poiciitial  iMcnif'cr.  bet  f>e  the  pful'abilit)'  Ifial  is  ^liosen  to  ivniain  in  die  .A/R 

si't.  We  ahsliaci  awas  lioin  sMalegies  <)t  manaaining  the  /WR  set  f'\'  ..issiinnng 


roi  all  nodes  i  )  ‘  /'  \/‘*vi 
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Of  course,  if  the  stream  of  syntactic  nodes  consists  of  one  node  ,vi ,  then  Pwm{^\ )  “  1  ■ 
Similarly  for  two  nodes,  =  1,  and  Pwm(A7)  =  1-  But  at  three  nodes,  Piva/(x,)  = 

Par(-<i).  for  i  =  1. 2  or  3.  At  four  node,s  there  are  two  opportunities  to  replace  members  of 
the  A/R  set,  so  we  have 

PmAx,)  Par{x,}Par(x,)  =  Par{x,)\  for  i=  1,2  or  3;  n  =  4  (4.3) 

In  general, 

PwMiXi)  ~  Par(x,T"‘  '''~  for  j,  n  >  2  (4.6) 


From  4.4  we  have 


If  /■  <  j  and  ij.  n  >  2,P,4ft(A)'’"'^'  < 


(4.7) 


From  4.6  and  4.7  we  have 


If  i  <  j  and  /  j,  n  >  2,  PwmiXi)  <  Pwm{Xj) 


(4.8) 


From  4.3  and  4.8  we  have 


If  /  <  j  and  /  j',  n  >  2,  PwM{Xi)PK{x,)  <  Pwm(Xj)Pk{Xj)  (4.9) 


From  4,2  and  4.9  wc  have 


If  i  <  j  md.ij,n  >  2.  Psix,)  <  PsiXj)  (410) 

which  is  the  recency  preference  (86). 

This  result  only  holds  for  nodes  that  are  competing  for  the  same  structural  index  in 
the  A/R  sec  Verbs  do  not  compete  v/ith  nouns  for  PP  adjunction,  nor  do  complemeni 
attachments  compete  with  adjuncts.  This  is  consistent  with  the  lact  tliat  Rigat  AssociatK.)n 
is  .not  a  good  predictoi  across  syntactic  categories,  o^betv^ee^  argLmieni/adjurict  ambiguities 
(Abney,  !9S9).  That  is  why  the  A  over  A  Closure  Principle  di.scussed  above  .s  formulated 
in  terms  of  nodes  of  the  same  category.  Ibis  is  borne  out  i:i  the  Whittemore  &  F'er’ara 
( 1990)  study  ul  PP  .ittacliiiK  nts,  where  Right  Association  was  ioundlo  lie  most  effective  in 
arbitrating  noun  noun  and  verb  verb  attachment  ambiguities  nor  accounted  foi  try  lexical 
argument  soucture. 

To  imierale,  this  analysis  neilhei  assumes  a  lecency  jueterence.  noi  iocs  i!  siiggesi 
raie  s'luiilri  be  iricor[.)or,iled  mio  N1 ,  .Soar,  It  is  merely  an  atlemp!  to  reveal  what  ellcct  die 
strnclurc  ot  the  A/R  set.  img.ht  luue.  on  amtnguily  lesolntion  indeponjeot  ol  die  p,utietil.it 
stratt'g.ics  iisv'd  Id  manage  ihe  eontents  oi  the  set,  or  to  perforn  amlisgimy  resoinlion  itself, 

I  be  demonsir.iiuni  shews  tiuit  an  .ii-ipinent  ici'ent  v  jiieferenee  emeige.s  as  a  basic  t).rop:,'ii\ 
ol  die  binited  A/K  sel. 
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4.1.3  Object/subject  and  other  “act  vs.  do-nothing”  ambigoities 

The  kind  of  local  ambiguities  that  emerge  in  parsing  are  a  function  of  the  particular  parsing 
algorithm;  different  parsing  schemes  may  exhibit  different  kinds  of  local  ambiguities  (Ab¬ 
ney  &  Johnson,  1991).  The  head-driven,  bottom-up  process  in  NL-Soar  sometimes  shifts 
the  detection  of  ambiguity  to  a  point  later  than  the  earliest  possible  point  that  the  ambiguity 
could  be  detected.  This  means  that  at  the  earlier  point  knowledge  cannot  be  brought  to  bear 
to  resolve  the  ambiguity. 

Consider  local  object/ subject  ambiguities  such  as  (87): 

(87)  Bob  knows  Susan  went  to  the  store. 

The  earliest  possible  point  that  the  ambiguity  may  be  detected  is  at  Susan.  However, 
detecting  the  ambiguity  at  this  point  would  require  positing  the  complement  phrase  for 
which  Susan  can  serve  as  the  subject.  Since  NL,-Soar  projects  phrases  from  their  heads,  the 
complement  phrase  will  not  be  created  until  went  arrives — too  late  to  affect  the  attachment 
choice  of  Susan. 

The  only  ambiguity  that  exists  at  Susan  is  a  choice  between  attaching  Susan  as  the 
object,  or  doing  nothing.  Given  NL-Soar 's  control  structure,  such  a  choice  is  no  choice  at 
all.  When  one  alternative  is  generated,  the  outcome  of  the  decision  procedure  is  to  proceed 
with  that  alternative  without  further  deliberation.  Thu:s,  NL-Soar  exhibits  a  preference  for 
objects  in  object/subject  ambiguities.  “Preference”  is  '.  erhaps  a  misnomer  since  the  system 
is  not  even  consideiing  the  alternative. 

The  preference  for  objects  in  such  ambiguous  structures  is  well  known  in  psycholinguis¬ 
tics  (.Hakes,  1972;  Frazier  &  Rayner,  1982;  Pritchett,  1992).  The  preference  is  generally 
detected  in  leading  time  .studies,  where  subjects  show  an  increased  reading  time  in  ambigu¬ 
ous  sentences  such  as  (  87)  over  unamlriguous  controls,  In  the  severe  ca.ses,  the  preference 
can  even  lead  to  a  garden  path  effect  (Chapter  6;  Pritchett,  1992). 

riiis  knul  c!  etfcci  may  arise  in  other  structure:',  as  well.  Consider  the  ambiguity  in  (88), 

( hec'u  will  initially  he  taken  as  an  NP  complement,  which  turns  out  to  l>e  correct  for  (88a), 
but  uiconect  for  (88b)  (iliougli  uo  garden  (lalb  effect  arises;  see  Cluqiter  6), 

(88)  (a)  1  like  green. 

( b)  1  like  green  Martians. 

Ills  pos'.ible  to  oven  ide  tins  e  fleet  m  Nl.  .Soar,  but  it  req  lines  a  deliberate  attempt  to  force 
an  impasse  in  the  piocessmg  where  none  woiiiri  »»ther\\'ise  exist,  i'liis  can  he  aceomphsheo 
1)V  1(11  iiuilaling  an  (O/'/;)  ;/  aluinatiic  oper.iloi  (perh:ips  Ja  mH/iuiv  operaloi.i,  wtiich 
wouiil  c'.msf  iic  niipasNC  with  the  -.cxistine  ope'ator,  I  hen  searc  h  control  assocnitions 
imghl  be  icainid  llial  axoid  taking  Uie  exi.^li  ig  path.  .Si,u.h  a  st  heuK.-  ctO'ctixely  (dates 
NJI  Soai  in  a  <  mcful  <  n’o/lf,  in  xvtih  it  cat  h  ilecision  is  re  t  xaliialeti  ''Vc  w  d! 

stt'  stioi  !l\  ihat  ihi ,  ioikl  ot  pioit'ssing  niigjil  t>e  uselul  ni  t  fi  t.an  siscs,  hm  it  iciiiiiol  bic  the 
liclaull  V'  av  ot  pci  loi  iimig  icai  link"  ct'siiirirchcn'  ion. 
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4.L4  Time  pressure:  failure  to  bring  knowledge  to  bear 

At  any  given  point  in  time,  knowledge  in  Soar  is  distributed  across  the  problem  spaces  in  a 
variety  of  ways  as  a  function  of  the  experience  of  the  system.  The  knowlecige  required  to 
perform  some  ambiguity  re.soIution  may  not  be  available  in  a  form  that  permits  real-time 
comprehension.  Consider  again  the  familiar  main  verb/reduced  relative  ambiguity: 

(89)  The  car  examined  . . . 

Ambiguities  of  this  type  can  sometimes  be  resolved  by  appeal  to  semantics,  specitically, 
whether  or  not  the  subject  of  the  sentence  is  a  plausible  agent  for  the  verb.  In  (89),  it  is  more 
likely  that  the  car  was  being  examined  rather  than  doing  the  examining.  If  a  search  control 
association  exists  that  tests  the  appropriate  semantic  features  and  prefers  the  u-constructor 
corresponding  tc  the  reduced  relative  interpretation,  then  this  knowledge  can  be  brought  to 
bear  on-line  during  the  comprehension  process. 

However,  nothing  guarantees  that  such  an  association  will  be  available.  If  the  knowledge 
is  present  only  in  the  lower  problem  spaces,  there  may  not  be  enough  time  to  perform  all  the 
inferences  necessary  to  make  the  right  selection.  (Recall  that  comprehension  can  proceed 
at  an  average  rate  of  about  four  operators  per  word).  Under  press  of  time,  there  may  be 
no  alternative  for  the  system  but  to  .select  one  interpretation  randomly  or  by  some  default 
preference\  In  any  case,  NL-Soar  is  behaving  in  a  modular  fashion  since  the  required 
knowledge  sources  are  not  applied  on-line. 

NL--Soar  makes  predictions  about  the  kind  of  ambiguities  and  contexts  that  will  tend  to 
create  modular  effects.  The  more  specific  the  relevant  knowledge  sources,  the  more  likely 
NL-Soar  will  fail  to  bring  them  to  bear  c  '-line,  because  specific  knowledge  sources  are 
less  likely  to  have  been  chunked.  1’he  more  general  the  knowledge  sources,  the  less  likely 
modular  effects  will  ar  ise,  since  it  is  more  likely  that  NL-Soar  will  have  encountered  the 
situations  necessary  to  learn  the  required  associations.  Thus,  we  should  expc'd  interactive 
cfi'ccts  bascil  on  lexical  seinanlics  to  be  more  pervasive  than  interaction  with  particuiar 
aspects  of  referential  contexts. 


4 A. 5  Mitskinu  cffVtTs:  liii^ui.stic  IjuMollnng 

The  dll lercntial  di.stnlnith.M!  ol  knowledge  across  [rioblvun  spaces  can  give  rise  to  aiiothei 
knui  t)t  citcct  that  is  independent  o!  icai  time  issue:,.  .Sornetmics  knovvledge  m  one  sp;iee 
masks  I’ne  presence  ol  kiiowle  Igc  avaihihle  in  lo-vei  spaces,  bec.iii.sc  the  knowledge  in  tlic 
otiiei  spaces  has  no!  vet  been  ehunked  into  a  loiiii  lluu  makes  it  imnieiiiatdv  .nailable  m 
ihe  liig’her  space,  if  the  inmiediaiely  available  knowledge  is  siitficieni  to  pincecil  wiili'iui 
impa-.sc,  then  ihe  svsiem  mav  iic\'ei  .leii-ss  tiie  acitlil'ciiiai  knowledge  iHvaiise  impasses 
[HO',  ide  ilie  o(  l  asnm  lo  do  so  I  Ins  n.  known  a-  the  niuskiih’  I'iicoi  in  aililn  lai  mtelligeiiee 
(  i.iiiihe  iV  i\ost‘iihlo('m.  I'tOi),  and  it  is  l.'Uilv  per\.r.i\'e  ni  .Soar.  I  oi  ex.inLple,  oike  Sum 
ii  1 1 : 's  a  I  ),n  I  ten  i.ir  secji  leins,'  i '!  moves  that  mi.  'eeiis  m  .i  game  m  pe  .■/  ie .  it  .\  d !  tend  lo  ,  /  /m  ;  /  \ 
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follow  that  sequence  in  future  situations,  regardless  of  whether  the  sequence  is  the  most 
efficient.  The  knowledge  to  explore  alternative  paths  may  exist  elsewhere  in  the  system, 
but  the  learned  associations  continue  to  guide  it  down  the  path  initially  learned. 

In  psychology,  this  is  known  as  Einstellung  (Luchins,  1942):  the  application  of  a  lecrned 
behav  ior  in  new  situations  in  which  the  behavior  is  not  necessarily  useful.  NL-Soar  predicts 
that  there  is  linguistic  Eins\e\[ung,  in  which  the  application  of  aspects  of  linguistic  skill  may 
actually  interfere  with  the  functional  demands  of  comprehension. 

Linguistic  Linstellung  can  arise  in  several  ways  in  NL-Soar.  The  presence  of  already- 
learned  u-constructors  and  s-constructors  may  mask  the  fact  that  an  alternative  interpretation 
exists,  because  the  alternative  corresponds  to  an  operator  that  has  not  yet  been  chunked.  This 
is  easiest  to  see  in  the  case  of  u-constructors.  Suppose  that  the  system  is  in  a  state  such  that 
a  Li-constructor  exists  (say,  u-constructor41)  to  attach  a  subject  to  an  incoming  verb,  but 
has  not  yet  learned  a  u-constructor  for  the  reduced  relative  construction.  Then  in  sentences 
such  as  (89),  the  ambiguity  will  not  even  be  detected  when  the  verb  arrives.  .At  examined, 
the  proposal  association  for  u-con3tr-jct.or41  will  fire,  and  that  u-constructor  will  be 
selected  since  there  are  no  available  alternatives.  The  occasion  to  learn  the  alternative 
construction  mu.st  come  via  other  linguistic  input  (perhaps  unambiguous  input),  or  else 
through  .some  more  deliberate  means  of  forcing  an  impasse  as  di.scussed  earlier. 

Of  course,  it  is  unlikely  that  adult  comprehenders  would  be  missing  u-constructors  for 
any  but  the  most  rare  syntactic  construction.s,  or  perhajis  for  constructions  encountered  in 
novel  idiolects.  S  constructors  are  more  likely  to  be  mi.ssing.  since  ihese  are  a  function  of  the 
semantics  o\  i\\c  linguistic  input.  More  general  s-constructors  may  mask  interpretations  that 
are  more  appiopnate  in  .some  specitic  context.  Tliis  would  predict,  lor  example,  that  people 
new  to  a  particular  task  environmem  with  its  own  task-specific  subh  uguage  will  iniiially 
bring  to  bear  their  existing  .'  kills  (in  liic  form  of  the  cxisliiig  ai)|'iu.ablc  s  c onsii iiclors) 
until  the  requiretl  interpretation  operators  are  biiiil  up.  Missing  s  constructors  can  also 
tiave  an  cflci  t  on  syntactic  anibiginly  resohi'ion,  since  sonit.':  downsticain  amhigintv  may 
be  resolved  as  a  liinclion  oi  lin.  .  'Miianlic  inteiiiielalion  established  so  lar. 

.Si'arch  coiiuol  kiiowlciigc  can  also  be  masKcii  v  uusniei  iigsiin  trie  mam  \ ei b.'i ciiuced 
lelati'.e  ambigiiilv  in  iS'.t'i.  Suppose  dial  a  sc.irch  control  assocKiI ion  has  been  It'anirii 
that  encodes  a  geiieiai  pietcicnce  loi  Itu  niain  veih  readiii;.',.  I'iie  geiieial  form  vil  'iich  ,is 
associalu'i!!  IS  gi'’eii  in  I'Mti. 

//  c  <  on:'. M  im  t.oi  4  I  IS  piop.osed  O'he  iiMiii  x  ei  li  cons' i  ;;c! ion  ' 

////  ,\  ('(cici  1,1  con;'. tenict. oi  ■■]  1  ,is  die  hes!  o[H‘i.tlO! 

.Snell  .1  piciciciici'  coiilii  hax'i;  been  le.iincd  m  a  siiu.iti on  wiieie  ih'-ie  si.c.  no  si'in.iiiiic  oi 
pi  an  1 1  i.il  ic  basis  loi  m.tkmg  c.  choice,  oi  in  a  sini.itioii  wiieie  ical  iimc  dcm.iiuls  did  noi 
pel  mil  .iccess  lo  die  senianlu /|iia‘’iiialie  knoM 

111  .11  i  \  inline  '-il  nal'ons  w  Ir-i  u  ■  n :  ■  1. 1  i-  s  o  i  -1  i  is  pi  op,  >'■;  d ,  clams  i  oi )  i  w  i  i  |  i,  ]  ( 
pnuliiu’  Nl  So, II  down  die  m.iin  \',-ib  p.illi  eseii  dioiicli  dicre  is  k  now  Icii;  c  n,  du  los'.ci 
■qcici".  d'..il  Miii'lil  scU'cl  :in  aliCi  nal > vc  inlei  |!ie!.i!  ion,  No  imjcisa'  .us  , as  I  he  odim  i  on 
lio|  k  iio\\  l.udcc  u-  niiukcd  bec.iijse  cliiiiik  :0(ii  peiMiils  i  ecou  i,- 1  i  u  a  i.d  iiioiudi  po-'-diix 


130 


Chapter  4.  Structural  Ambiguity  Resolution 


incorrect — behavior  to  occur.  Again,  such  behavior  can  be  overcome  with  a  delibeiate 
attempt  to  reconsider  each  decision,  but  there  is  no  way  of  knowing  that  the  decision  might 
need  to  be  reconsidered  until  it  is  too  late.  This  kind  of  masking  of  control  Knowledge  is 
the  paradigmatic  way  that  Einstellung  shows  up  in  Soar. 


4.2  Interactive  effects 

NL-Soar  operates  as  an  interactive  architecture  whenever  search  control  associations  that 
embody  non-syntactic  information  guide  the  selection  of  u-constriictors.  Section  3.6. 1  pro¬ 
vided  an  example  of  syntactic  ambiguity  resolution  based  on  referential  context.  Interactive 
effects  can  arise  in  principle  because  there  are  no  architectural  limitations  on  what  may 
be  encoded  in  the  conditions  of  control  associations.  They  are  only  limited  by  what  is 
represented  m  working  niemoiy  at  the  moment  the  ambiguity  arises.  In  the  cases  discussed 
so  far,  we  simply  posited  the  appropriate  search  control  association  to  effect  the  ambiguity 
resolution.  A  much  stronger  theory  would  explain  the  origin  of  these  associations.  The 
remainder  of  this  section  describes  how  NL-Soar  can  in  fact  learn  such  search  control 
chunks. 

As  an  example,  we  will  use  the  main  verb/rcduccd  relative  ambiguity,  repeated  below; 
(‘rl! )  Fhe  car  examined  . , . 

Suppose  tiuit  NL-Soar  has  already  learned  (lie  u-conslructors  correspondmg  to  the  main 
verb  and  reduced  relative  constructions,  and  furthermore,  lias  learned  a  genera!  prelerence 
.issociation  for  m.un  v<-rli.s  (90).  The  diiwiission  of  the  masking  effect  above  makes  clear 
that  o''e:conimg  this  preference  to  learn  the  correct  .search  control  rule  will  he  a  nontri'  ial 
matter. 

f  igure  4.1  .shows  what  happens  on  the  iiutia!  [>a:vs  through  the  .sentence  fragment.  This 
hgure  introduces  an  altcrnaine  way  ut  dhisiralmg  the  system  behavior  thai  will  prow  more 
efiicicnt  for  the  cm  rent  (unposcs  Lacti  line  beginning,  wiih  a  number  and  a  single  letter 
tollowfd  iiY  a  colon  cor respunvls  to  a  dei'ision  cycle  0  :  denotes  an  opeialor.  i’ :  denotes  a 
(tioi'lem  space,  aial  1  :  denotes  .m  nn|>asse  {  [  .and  I’  noi  shoun  ni  tins  lignie),  impasses  will 
be  nuli'iited  u>  iiuhe.ile  [inx.essmg  m  .i  siibgoa!.  Indi'ridn.ii  i  liiuif  firings  will  scinelnnes  be 
ikw  al  oil  M.  par.tle  Iiik's;  ilu’V  do  not  eoriespond  to  separ.ile  dceiMun  eveles.  iiiis  uaee  is 
s’ei  .  rated  lioni  llu'  .leliial  svstem  ouijuit. 

ill  the  miti.il  pass,  aliei  ihe  ap[>healion  ol  r;  -(.mi:; t.  fin- toi  39,  both  n  consKur  loi  s 
( i!  oil : .  t  !  111.  !.  o !  4  1 ,  1 1 1.1  in  verb.  ,ind  u  ■  i  orn; !.  rm  t  oi  'ha,  i  ediu  ed  i  el.ui  \e  i  aie  [)i  op,  sed  in 
p,ii  ,illel.  ,iiid  liie  assor  i.il  :oii  In  e-.  ih.if  pielers  the  mam  \  ei  b  u  eeaisli  in.  loi.  .A'  ex['e<  ;  1 1  ,iii 
I  lie  (Ml !  lei  ( I  isi'Mssi,  >i|  1  4  n  n:-  k  i  ae  el  b.  em,  the-.  le  ■.nils  :  n  die  selert  lo'i  ol  u  sa  n : :  t  i  lu  i  ;>  la]  i 
I  tie  ,  ( Ills!  1 11(4 Ol  lii.it  b  it  low  :.  I  i  i-alt's  i!  le  si!  ii.iiion  model  i  ibiei.  Is  i  ejnesc!  li  mg  1  tie  ail  i  i.ilM in 

m  lie  ti  llu  ( MI  la  iK 'lui'  itn  (.‘s.iiimmig  Ni  So.ii  ii.o  demoiia! i .lu  d  .t  (  l.ia  iv  niodul.ii  n \ 

>  !l( a  I 

.\  auiiic  ili.il  ihi,'  a',  aicii;  b.f,  soiti','  (  .ip.ibilils  l,i  lU'Iico  si'fn.miu:  .momaliea.  so  iti.U  .it 
'•oma  I'oiiu  ihc  (Ol  lie  ill  oi  [Ik  •auuiboi!  niodai  amiol.ibsl  .n  iniid.inMble  A  \a  ,i\  io 
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t;u 

READING  WORD;  the 

0;  u- conr,truc.t,or34 

READING  WORD:  car 

0 ;  u~ construe toi 36 
G:  s- constructor39 

READING  WORD:  excunined 

Firing  cbunk-5'f’4  ;  propose  main  verb 

Firing  chunk-488  ,  propose  reduced  relative 

Firing  pref er-main-verb 

G:  u-constructor41 
0:  s-constructor44 


Figure  4.  l ;  First  pass  through  The  car  examined.  Upon  encounteting  examined,  two  associations 
fire  in  parallel  proposing  operators  corresponding  to  the  main  verb  and  relative  druse  stmetures. 
The  main  verb  proposal  immediateiy  triggers  a  general  association  preferring  this  operator,  so  a 
the  next  decision  cycle,  the  main  verb  operator  is  selected  (u-'r.onstructor41). 

accomplish  this  in  the  present  example  is  with  an  operator  that  matches  an  inanimate  agent 
and  marks  the  situation  model  as  implausible. 

There  are  a  number  of  possible  ways  the  system  could  respond  to  such  anomalies.  One 
plausible  response  is  to  attempt  to  rccomprehend  the  input  more  careluHy.  In  reading, 
this  could  lake  place  via  regressive  eye  movements  fCarpcnler  &  Darieman,  198.1;  Just 
A  Carpenter,  1987),  in  speech,  by  appealing  to  a  short  lenn  verbatim  niemoiy,  or  simply 
asking  the  sficaker  to  repeat,  dhe  details  of  liow  it  happens  arc  not  important  here.  We 
sim|)ly  assume  that  there  is  some  deliberate  attention  that  enables  the  system  to  coiufuchend 
the  liagmem  again  l'r>  tlie  start,  buitheiniure,  it  is  not  critical  nTicri  the  decision  is  rnaclt 
to  rccomprehend  The  present  model  decides  to  rccomprehend  as  soon  as  the  anomaly  is 
dciecicd.  Othei  delay  slcaegies  are  povsibic. 

Fig, me  4  2  illustrates  this  proces.s,  starting  with  the  last  operator  from  I'igure  4  i. 
i  .hice  ilie  anmnaly  is  delected,  the  mtend  opeiaior  is  selected,  repre.sentnig  the  intention  lu 
rcpioces-  ihe  uipui  more  caiefully.  What  iiues  il  mean  tor  NL  Soar  lo  compreheml  more 
t-iucjiilly'  1  he  answer  to  tins  ijncstuin  was  skeiciicd  in  the  §4.1.3  L.)ec!Sions  that  were 
picviouslv  iiiade  wilhoul  irufja.'-.se  must  now  be  recoii.iidcrv'd 

Wilt!!  Nl,-.So:ii  ciiniprehends  caieiii!ly,  it  toices  impasses  in  situations  Wiierc  Iheo' 
i’  tiuin,-  than  oni;  r)peran):  ['ropo.'.cii,  I'hc  second  iinic  tlnougli,  iin  niijsassc’  arises  ai 
c  winniiCil.  j'crmiUing  rnoic  thorough  evaluation  of  each  alternati  ve  1  he  ii-consU  Uv  'ois  an' 
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\ 

1 

1  0;  notice-anonaly 

j  Cl;  attend 

READIMG  WORD:  the 
0:  u--constr'ictor34 

I 

READING  WORD;  ca¬ 
ll  vi-coa:3t:'uctor36 
0  s-corc'^ructorSy 

HEADING  W'l.lP;  examined 

Firing  chunk -574  ;  propose  main  verb 

Firing  chunk--488  ;  propose  reduced  relative 

Firing  pref er-main-verb 

==>I;  operate/  i i o 
P;  Selei'  +  ioi! 

0:  evaluate  (,'!-constructor41) 

==>I;  operator  application 
P;  Coir oreheus ion 
0;  u-con,  cructor41 
0;  s~  con  rt-uctorM 
0:  eval  te-situat ion-model 

Evaluation  of  i;-co,?istructor41  is  implausible 

0 ;  evaluate (u-constructor45) 

==>1;  operator  apj.)lication 
P;  Comprehension 
0;  u-constru(;toi45 
0;  s-constructor53 
0 ;  evaluate-Eituatioi'v-modeJ 

E'.valuat ion  of  u-cionst::ructor45  is  pla'isibJe 
Bui 1 d :  chunk - S97 

0  u- (X-nst  rue.  t  or4S  ;  reduced  -e.latlve 

0  ;3-  ci:>r.;rr  I  nctorbd 


I  UdiRh  4,1'  Carcii.ill>  te  s  oinfufiirudiug  /'u:  -m  i-va/nini'il.  An  u)i[>asse  toacd  .d  (hr  iiouii 
'.vheit'  (hr  ofjc'r.iti  ins  uu:  pii.'f.-o  >od,  ami  six*  dciusuin  to  sidcx!  the  niaus  verb  operator  rs  iceoii 

>u.u.!ed  [euli  operatoi  evaluateti  by  ajrplyuig  i*  ar'd  c\;;hialUiU  itit  leMtlliuj.'  siu.iatuin  luodol  fu! 
piaur.jbddy  A  [)F<4e.i'  ut  c  is  n  turned  icu  :.tie  relative  clause  ojXautor,  tie,  !iny  a  uex  chutik 
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evaluated  by  siniplv  applying  theni,  ailowiiig  the  s-constructors  to  perform  the  semantic 
interpretation,  and  then  comparing  the  interpretations,  U"Constructor41  produces  an 
implausible  evaluation  as  it  did  earlier;  u-ccnstruc'i;or45  produces  a  plausible  evaluation. 
Based  on  these  two  evaluations,  a  preference  is  returned  prefetiing  u-coxi!;-;t.riictoc45 
over  u-constrnctor41.  This  re.solves  the  original  impasse,  resulting  in  the  selection  of 
u-constructor45,  and  processing  continues  in  the  top  space  as  usual. 

I’he  resolution  of  the  impasse  produces  a  search  control  chunk  (chuiik-597)  of  the 
form; 

(F  u--constructor41  and  u-coostroctor  >^5  "-c  proposed 
and  the  incoming  verb  is  examine 
and  the  preceding  NP  refers  to  sor,'..:h.hng  iiutniinaie 
J'HE.N  prefer  n-'Constructor45  over  u-conc  .-ructor41 

This  is  precisely  the  kind  of  sernantic  association  we  assumed  could  exist  in  principle  in 
the  earlier  discussions  of  interaciivr  and  rnodular  ambiguity  resolution.  Criticall).  it  is  not 
cijndilioned  upon  ML-Soar  bei.ng  in  careful  comprehension  mode,  since  that  was  irrelevant 
to  the  problem  solving  that  produced  the  chunk  (of  course,  it  was  exceedingly  ielcvarn  to 
initiating  the  problem  solving)'*. 

Figure  4.3  shows  what  happens  now  that  tire  chunk  s  in  long  term  memory,  A'l  the 
amb'guity,  the  two  u -constructors  ta'c » 'roposed  its  u.suah  This  is  (m:mediatf;iy  followed  by  tiie 
firing  of  the  two  control  a.ssocia!,  on;r.  the  general  preference  fo'"  u  -const  r-uctorT,  1,  and  the 
semantic  preference  for  u-con,-:tri-c.tor4b  over  u-  co*i;;'t;ruc'tor4..i,  U-  co;iU:  tcaci-,i.w4b 
is  then  selected  without  futthe!  Jeliberaiicc  (because  more  specific  bina-'y  pieferences  tak" 
precedence  over  umuy  ciefercnces  in  Soar).  NL-Soa^'  now  exhibits  classic  mteractive 
e-fftets. 

ri'iis  schenie  i;;  an  instantiattou  of  lire  genera',  metnod  of  rccoveiy  from  inrorect  knowl¬ 
edge  in  boar  (Laird,  1988).  The  distin.guirhing  fieature  c-f  tiiis  recovery  it;  Soar  is  that 
ihe  sc'  ,;.e  decision  i.s  corrected  monoton  e  additior-s  to  the  lung  lervn  fnemoiy---i)o 
associiation  is  ch.ingea  or  removec.  The  orig  aa!  genci'.'ii  preference  doe:;  not  go  awr.y 
with  the  acquisition  oi  the  new  semantic  con'ro'i  assoeiatio.!!.  J  his  is  a  nccessaiy  resali  of 
the  fiiridamenta.i  a,ssun.-pti<.ia  in  Soar  that  lot.v  (.nn  menuny  is  perniarvim  and  cogfutivefy 
impenetrable  Tl,e  dei.dicrat':  nature  of  ti’e  nicth-Hl  (the  I'equircrnent  to  iorce  impasse,. s  is  a 
dir.;'  .'!  result  oi  the  I'  sisking  el  feet  noted  cai  fiv** 

Alth(.)U‘:h  '■.•compren(:.''idiiig  i.s  a  very  simple  :w;t  erne  for  error  recovery,  it  ha,-  'nimber 
of  fcamre',  'ha;  make  it  iCirly  plausibic.  Ihere  is  i  lmndan*  ev.'deni.'c  frr.nn  eye  n.uvt'meni 
studies  ioi  regressions  durmg  i  .nnfirelietision,  wid.  pauses  at  d  fficuit  ttiateria!  (lust  d. 
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READING  WGRIj  :  the 

0 :  u-constructorS^' 

READING  WORD:  car 

0;  u-construc.tor36 
0:  S"Constructor39 

READING  WORD:  examined 

Firing  chunk-574 
Firing  chunk-48S 
Firing  pref er-main-verb 
Firing  chunk -597 

0  u-const3’uctor45 
0  s -constructor 53 


;  propose  main  verb 
;  propose  reduced  relative 

;  prefer  I'elative  to  main  verb 

;  reduced  relative 


FIGI'RE  4  .3:  Comprehending  The  car  examined  after  leanuivg.  The  new  semantic  search  control 
clrunk-597  fires,  guiding  the  syntactic  parsing  down  the  correct  path. 


The  theory  of  ambiguity  resolution  descrilted  above  has  three  components:  a  set  of  func- 
In  ua!  mcchanisins  designed  to  approximate  a  knowledge-level  view  of  ambiguity  resolu¬ 
tion.  a  variety  of  ways  that  these  mechanisms  fail  to  reach  the  knowledge  level,  aiKf  the 
means  !>y  winch  some  of  tnfi,se  limitations  can  be  overcome  with  learning. 

Ml..,- Soar  is  hrsi  and  foremost  a  junctional  theory  of  langu-agc  comprefierisiori,  ius:  as 
Soar  is  |,,!  inua  ily  a  runctiona!  titeory  of  cognitive  archi'evtiae.  The  mode!  th-.v.  embraces 
what  .migfit  be  caiicd  the  Inrjwleage  level  theory  o,f  amhiguity  rei-.olufion: 

Krowledge  levellheorv  fj  ambig'iiiv  resolution:  Any  knowledot::  source  iiiay 
he  bM.ujgfii  fo  bear  lo  resolve  local  anibiginlR'K  in  language  r-omprehesi  siou. 
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a.ssocu)tion.s  thar  guide  processing,  and  these  knowledge  sources  are  brought  to  bear  on¬ 
line  at  each  decision  cycle.  We  have  seen  examples  of  NL-Soar  using  referentiai  context 
(§3.6.  i  )  and  semantic  ccntenf  (above)  to  resolve  ambiguities  on-line. 

However,  we  have  also  seen  that  the  symbol -level  mechanisms  that  constitute  NL- 
Soar  may  fail  to  perfectly  realize  the  knowledge  level,  and  this  failure  can  be  detected 
behavioral ly\  The  following  list  summarizes  the  ways  this  may  happen; 


1 .  On’y  immediately  available  knowledge  can  affect  resolution  on-line  (under  press  of 
time);  what  is  immediately  available  is  a  fimction  of  experience. 

2.  The.  A/R  set  permits  only  a  small  subset  of  the  syntactically  permissible  attachments 
to  be  detected;  this  also  leads  to  a  genera!  recency  preference, 

3.  Immediately  available  search  control  knovdedge  may  mask  other  search  control 
know'ledge  sources  presen'  in  the  system. 

4.  Immediately  available  comprehension  operators  may  inask  other  possible  semantic 
ors>ntactic  alternatives. 

5  Some  ambiguities  (e.g.,  subjecLobject)  are  not  detected  immediately  because  the 
ahernatives  emerge  late  in  head-driven  processing. 


The  h.nai  component  of  the  tlieory  is  the  role  of  learning  and  experience  in  ambiguity 
resolution.  The  example  in  §4.2  shows  how  the  symbol-level  failures  can  be  overcome 
vviih  more  carefui  reprocessing  of  the  linguistic  input,  and  bow  this  reprocessing  gives 
ri,se  to  new'  associations  that  effect  knowledge-level  ambiguity  resolution  m-line.  Thus, 
NL-Soar  not  only  explains  how  interactive  and  modular  effects  arise,  but  i  also  provides 
the  m.!chanisni  b}'  which  the  shift  can  be  made  from  modular  to  interactive  behavior. 


4.4  is  ,N.L-‘vSoiU'  modular? 

Nov\  tiau  the  tfieory  ot  ambigintv  re.soiution  has  beeii  de.scribcd  in,  .s'iuie  delail,  H  is  instruc¬ 
tive  Id  step  !>ack  and  ash'  h.  NL- Soar  rnDdidar'' 
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§3.7  that  this  basic  distribution  of  knowledge  sources  was  motivated  by  computational 
and  functional  concerns).  Semantic  interpretation  knowledge  is  held  in  the  s-construciors; 
semantic  and  contextual  search  control  knowledge  is  held  in  the  search  control  associations 
for  comprehension  operators.  Thus,,  the  large  set  of  associations  that  propose  and  apply 
u-constructors  in  effect  comprises  an  informationally  encapsulated  syntactic  module. 

We  can  gain  further  insight  into  the  issue  by  considering  NL-Soar  along  several  of 
Fodor’s  (1983)  dimensions  for  characterizing  modularity.  Newell  (1990)  did  this  in  the 
context  of  the  initial  Soar  comprehension  theory,  but  his  analysis  still  holds.  What  follows 
is  a  partial  summary  of  this  analysis  with  respect  to  the  present  NL-Soar. 

Domain  specificity.  Modules  become  highly  tuned  to  the  specific  characteristics  of  the 
domain  (in  Fodor’s  analysis,  the  possible  domains  are  perception  (vision,  audition,  etc.), 
plus  language).  Chunking  is  precisely  a  system  for  building  up  mechanisms  (chunks)  tuned 
to  specific  aspects  of  the  environment.  The  chunks  that  comprise  NL-Soar’s  recognitional 
comprehension  capability  form  a  special-purpose  system.  As  New'^ll  pointed  out,  “In 
Soar,  the  generality  of  cognition  is  not  that  every  situation  must  be  treated  generally,  but 
that  generality  is  always  possible  when  knowledge  is  missing  (impasses)  and  that  the 
construction  of  special-purpose  mechanisms  (chunks)  works  to  avoid  future  breakout.” 

Mandatory  operation.  The  linguistic  module  applies  in  a  mandatory  fashion:  one  must 
tieat  received  speech  as  speech.  The  recognition  memory  in  Soar  is  also  mandatory  and 
automatic.  For  example,  comprehension  operators  are  proposed  automatically  whenever 
their  relevant  inputs  are  present  in  working  memory.  Once  the  process  has  been  chunked, 
there  is  no  way  to  avoid  this.  The  decision  cycle  does  pro;  ide  the  level  of  control  that 
may  permit  something  different  to  be  done  with  the  input.  But  a.s  we  have  seen,  the 
masking  effei  t  in  Soar  mitigates  heavily  in  favor  of  whatever  recognitio'  J  skill  can  be 
applied  at  the  moment.  Special,  deliberate  inodes  of  processing  along  w'ith  sufficient 
practice  are  required  to  build  up  the  skills  that  might  compete  as  alternative  to  the  existing 
comprehension  operators.  Even  then,  the  comprehension  operators  will  still  be  applicable, 
so  the  emerging  behavior  will  be  a  mix  of  the  existing  and  new  skills.  Furthermore, 
ihe  encoding  productions  (assumed  to  handle  much  of  lexical  access  in  NL-Soar)  fire 
automatically  without  any  control  from  central  cognition,  so  that  only  post- lexical-access 
processing  can  be  modulated  in  this  way. 

Fast  operation.  Language  comprehension  is  a  real-time,  rapid  process  that  presses  the 
limits  ot  neural  technology.  FIk'  basic  structure  ol  Nl.-tSoar  is  luiidamentally  responsive 
to  this  constraint.  NL  Sotir  comprehends  each  word  with  a  few  60  ms  operators  per  woid. 

I  he  entire  structure  of  the  recognition  memory  in  Srair  is  designed  'o  bring  knowdedge  to 
hear  rapithy,  Chajiter  7  will  deal  with  issues  of  teal  time  immediacy  of  iiiterprclatH)n  m 
greater  detaii. 

h.joi tnation  etica;).^: .laiion.  Modiili's  access  a  hnnted  set  of  dcsi.ngiiished  knov\  ledge 
sources:  tlicv  are  not  open  to  tfie  general  inferences  of  central  cognition.  As  we  liave  seen, 
liic  cimiiks  cong'i  ising  the  piojiosal  and  appliealioii  of  u  eonsti net  )i s  access  only  syutaciic 
knowledge  In  general,  the  know  ledge  I'rouglil  to  fiear  t>v  the  reeogm'ional  eornf!rclie:isio,i) 

(  a[sil)!lii\  IS  ininted  b\  \\  hjle\er  is  encoded  at  the  mnment  in  the  cfiunks  iliat  im[)leii!ent 
tlie  tori  jr\ ei  (  C)in[)n,‘liens!iin  ojioiatois  iful  'we  ha ve  alsi i  ,sei.-n  tfiat  it  is  [)o:,sil)lc  to  [icnetrate 
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this  capability  with  the  results  of  general  cognitive  activity.  As  Newell  points  out,  however, 
“whether  Soar  would  ever  be  overwhelmed  by  a  loss  of  control  in  a  great  rush  of  cogitations 
seems  dubious.”  Arbitrary  processing  only  comes  in  to  play  when  impasses  arise.  Given  a 
highly-practiced  skill  such  as  comprehension,  and  given  the  ubiquitous  masking  effect  in 
Soar,  the  frequency  of  such  impasses  will  be  rather  limited.  And  even  during  the  resolution 
of  an  impasse,  control  is  not  lost.  Soar  does  not  have  a  subroutine  control  structure — any 
processing  is  interruptible  at  the  next  decision  cycle. 

We  arrive  (somewhat  appropriately  for  this  chapter)  at  an  ambiguity.  NL-Soar  can  be 
seen  as  having  many  of  the  critical  characteristics  of  modular  systems,  both  structurally 
and  behaviorally.  Yet,  it  also  has  many  of  the  characteristics  of  an  interactive  system,  both 
structurally  and  behaviorally. 

NL-Soar  essentially  provides  the  same  fundamental  answer  as  the  modularity  thesis 
to  the  question  of  why  certain  limitation.,  in  ambiguity  resolution  arise:  the  limits  derive 
primarily  from  a  system  structured  and  tuned  to  perform  comprehension  in  real-time. 
However,  the  routf  by  which  Nl^-Soar  arrives  at  this  answer — an  approach  concerned 
primarily  with  fun  ionality  and  positing  specific  nrechanisrns,  and  embedded  in  a  general 
cognitive  architecture — has  yielded  a  m  ich  richer  theory  than  might  otherwise  have  been 
possible.  NL-Soar  explains  modular  and  interactive  behavior  on  the  basis  of  more  general 
principles  of  cognitive  rchiteNure.  It  predicts  that  limitations  will  flow  from  functionally 
motivated  aspects  of  the  model  (Young,  1993).  It  explains  how  the  system,  through 
experience,  might  overcome  these  limitations  and  make  the  shift  from  modular  to  interactive 
behavior.  Finally,  it  addresses  in  a  deep  way  the  relationship  between  automatic  and 
deliberate  processes,  opening  the  door  .’o  a  better  understanding  of  he  relationship  o.) 
language  and  cognition  generally,  rather  than  leavii.g  central  cognition  as  an  unexplicated 
black  hole  (Let. is,  1992;  Lchma.i  et  a,.,  1993b). 

4.5  General  discussion 

This  chapter  has  painted  a  fairly  complex  picture  of  ambiguity  resolution,  but  it  all  emerges 
from  I  small  .set  of  assumptioms:  the  basic  control  structure  and  learning  mechanisms  of 
-Soar,  plu.s  die  limited  structure  of  the  A/R  set.  The  predictions  are  consistent  with  what  is 
known  about  ambiguity  resolution,  wliicli  itself  (luiiits  a  fairly  complex  picture. 

One  potential  problem  for  a  theory  of  ambiguity  re.solution  such  as  the  one  presented 
here  is  the  dilliculty  of  ftilsiticalion.  If  ambiguity  resohition  is  guided  by  any  immediately 
available  knowledge  aiurce,  then  potentially  any  result  can  be  accounted  for  by  positing 
the  rigli!  knowledge  sources.  ’I'h.ere  is  a  genuine  danger  here.  However,  there  is  a  reso¬ 
lution  tu  this  difficultv.  I  Ulmiatelv,  NL  Soar  imisl  not  be  just  a  tlicoi  v  tliat  specifies  how 
niullipli'  knowledge  sources  interact,  but  a  theory  oi’  die  aci/uisition  of  that  knowledge 
as  well.  Seetioii  1,2  sketclieil  tiic  beginmng.s  of  |ust  siu  h  a  theory.  With  an  acquisition 
ino(ici,  the  ich-vant  assocnUions  that  accompiish  anibigmty  resohition  are  not  [>os!ted  I'V  the 
lla-oiisl  Instead,  they  aiise  when  the  inoiiel  is  placed  m  jsuticulai  icannng  '■itiialions.  New 
c\|)c;  nnents  iiipi.'ht  lie  dc\  iscd  ihai  c.Kplicilly  (t  si  die  tlicorv  by  placing,  siilijccts  m  lini;u:\iii 
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training  situations,  and  using  the  traditional  pre-  and  post-tests  to  determine  the  behavioral 
changes  that  result  from  learning. 

In  any  event,  it  is  certainly  the  case  that  NL-Soar  is  able  to  make  predictions  concerning 
ambiguity  resolution  independently  of  posited  immediate  knowledge  sources.  Section  4.1 
provided  several  examples:  the  limited  sub.set  of  attachment  sites,,  the  general  recency 
preference,  the  preference  for  objects  over  subjects.  On  this  score,  NL-Soar  has  proven 
accurate.  As  mentioned  in  Chapter  2,  the  challenge  for  theories  that  make  clear  predictions 
of  strucHn  al  biases  in  every  ambiguous  situation  is  .accounting  for  the  plasticity  of  resolution 
demonstrated  across  a  range  of  C'  ntexts  and  ambiguity  types.  For  these  effects,  NL-Soar 
provides  a  consistent  account,  and  pushes  the  state  of  the  science  farther  than  any  existing 
model  by  begin:  ag  to  show  how  these  interactive  knowledge  sources  can  be  arise. 


Chapter  5 

Parsing  Breakdown  and  Acceptable 
Embeddings 

PARSING  BREAKDOWN  ON  CENTER-EMBEDDINGS  is  one  of  the  best  known  phenomena 
in  cognitive  psychology,  and  the  number  of  theories  proposed  over  the  years  to 
explain  it  attests  to  this  fact  (§2.5).  However,  it  was  not  until  Gibson  (1991)  that  any 
theory  dealt  with  the  variety  of  difficult  embeddings,  and  perhaps  even  more  importantly, 
the  variety  of  complementary  acceptable  eml:)eddings.  This  chapter  describes  NL-Soar’s 
account  of  both  difficult  and  acceptable  embeddings.  The  first  section  outlines  the  theory  of 
parsing  breakdown,  derived  directly  from  the  structure  of  the  model  presented  in  Chapter  3, 
paiticularly  the  A/R  set.  Then  the  theory  is  applied  in  detail  to  the  43-.sentence  collection 
of  difficult  and  acceptable  embeddings  presented  in  Chapter  2  Next  we  consider  how 
NL-Soar  accounts  for  the  major  qualitative  phenomena  surrounding  parsing  breakdown. 
The  chapter  concludes  with  a  brief  discussion  and  summary  of  the  results.  Since  the  A/R  set 
plays  a  role  in  explaining  all  the  major  phenomena  addressed  in  this  thesis,  a  full  discussion 
of  the  yVR  set  and  the  “magic  number  two”  v/ill  be  delayed  until  Chapter  9. 

5.1  The  NL-Soar  theory  of  parsing  hreakdowii 

NL-,Soar’s  theory  of  parsing  breakdown  belongs  to  the  class  of  archiiectural  theories  of 
l^reakdown  (§2.5),  rather  than  the  d;  -.s  of  structural  metrics  (of  course,  a  metiic  can 
be  derived  from  any  architectural  theory,  but  the  convtise  is  no*  necessarily  true).  The 
predictions  of  the  theory  primarily  derive  from  the  .VR  .set,  with  its  two  valued  syntactic 
indices. 

Recall  again  the  strutture  ol  the  A/R  set  (§3. .13),  'I’his  is  Ihc  data  structere  in  working 
iiicinory  iliat  indexes  nodes  in  the  utterance  inodei  by  Iheir  potential  syniai  tic  tciations. 
The  sei  or  syntacue  n  lation.s  eorresponds  to  X  iuu  .stici.  luial  positnjns  (spec  IP,  comp  V’. 
etc.).  I'.ach  reiation  indexes  no  more  Ibaii  iwo  nodes.  3  hu.s,  |),usiiig  brc.ikdowri  will  occur 
vvficncver  a  [.'ailiculat  .svaitactic  sliuctuie  reqiiirt's  ihai  -i  relation  nuKo.  tluci^  oi  rnoie  norics. 
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More  precisely,  breakdown  occurs  at  the  point  when  a  node  is  needed  in  the  parse  but 
it  is  not  available  in  the  A/Fl  set.  Consider  the  classic  center-embedded  object  relative': 

(93;  PBI)  The  man  that  the  dog  that  the  boy  saw  liked  ran  away. 

The  receivers  set  must  index  the  three  initial  NPs  under  spec-IP,  since  all  three  NPs  will 
eventually  occupy  subject  position; 

RhcBIVERS^  spec  -IP.  [a-p  the  man],[N/’  the  dog},[NP  the  boy] 

Breakdown  will  occur  at  one  of  the  final  verbs  (wltich  verb  depends  on  which  NP  is  dropped 
from  the  A/R  set).  Breakdown  does  not  occur  simply  as  a  result  of  nodes  dropping  f’-om 
tile  set.  In  fact,  this  happens  continuously,  without  leading  *o  unacceptability.  The  clearest 
example  is  the  classic  right  branching  structure: 

(94;  AEl)  The  dog  sav'  the  man  that  chased  the  boy  into  the  ti.ble  that  the  cop  nit. 

Ii'  (94),  the  streap'  of  NPs  cleaidy  overloads  the  A/R  set  (partic  jlajrly.  the  adjoin- N’  relation 
in  the  assigners  set),  but  no  breakdov/n  occurs  because  no  more  than  one  NP  must  be  held 
c.  any  given  time.  (Recall  the  empirical  evidence  presented  in  Chapter  4  that  suggests  only 
a  subset  of  nodes  in  a  right  branchi  ng  structur  -'s  arc  availaole  for  attachment.) 

This  IS  essentially  an  interference  theory  of  rbort-ferm  memory  ^or  syntactic  structure. 
The  capacity  of  the  A/R  set  is  not  defined  absolucd)  v.  terms  of  r.umbe.  (/.syntactic  nodes, 
but  rather  is  a  function  of  the  syntactic  content  of  w  hat  is  to  be  stored.  When  nodes  must 
be  indexed  by  the  same  syntactic  relations,  they  inte.rfer^  with  each  other,  and  the  li.xcd 
capacity  is  quickly  reached.  Nodes  indexed  by  different  relations,  however,  do  not  pres,; 
the  limits  of  the  structure.  Thus,  the  total  capacity  of  the  .A/R  set  is  a  function  of  fne  set  of 
available  syntactic  discriminators  (as,sumed  here  to  be  .X  -bar  structural  positions). 

The  distinguisliing  chaiactcnstics  of  thi.s  theory  can  be  clarified  by  comparing  it  with 
Other  models  of  short-term  linguistic  mentory.  The  theory  differs  from  content  independent 
theories  of  storage  limitations,  .such  a.s  Yngvtds  ( I960;  original  slack-based  mociel,  which 
posits  a  fixed-capacity  stack  used  for  the  uniform  storage  of  syntactic  nodes  NL-, Soar’s 

'A  fc'A'  cxampic.s  have  b('(!n  given  iii  itie  liicrature  nf  doubly  ecnier-cnibedded  object  relatives  which 
appeal  to  be  riiore  acceplable.  'ITiey  include. 

prj)  (a)  'llie  whoci  the  secrciary  we  hred  slept  with  is  a  real  lucky  .los;,  (Kiiiiliail,  l‘J7:V) 

.b)  Isn'l  It  true  that  o.aiiiple  sentences  that  people  lhal  you  know  prodoce  aie  more  likely 
lo  lie  .iCi-t.'pled','  t  l'le  Koci  k  e!  al., 

U  .i  .A  syn’.is  hof>k  that  Svi.nic  Itali  in  ihat  I  had  nirver  heard  oi  wiou.-  was  fndilisiicd  bv  Mi  l 
jircss,  (biaiik,  lU92t 

Ahhoui'li  such  senlciu.cs  h.ivc  iiecri  |ioipr,;,r.-.'!  I'S  e.’i.aniples  of  seiu  itiiic  or  fit.iyliialK  c'Ici  ts  cni  ;nkn.>'.,; 
coniriiunaiily  atiiK'a:.-.  to  he  U.ic  use  of  ,i  picnoun  iii  tfie  inosi  tieeph  enibcdded  r  lause  i/,  or,  and  o  ,■  soovc.) 
(i:vfu  (ills  it  seems  prcniainie  lo  rlasMly  'ticse  cxauiplfs  a-,  piueb'  a-ni.uiin  or  piasnss'i  ctle.,(s 

( tfavvnic  the  cone.  !  s'cnci  eii/atiuu  and  .kcouiiImi!'  to:  the  etiet  t  wiiiun  die  cm  rent  ih;  or  \  ail!  iw 
tulni  c  re  .ea.icti 


in  .c'c.s  c  >i 
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model  has  a  content-dependent  capacity.  The  theory  dTfers  from  semantic  forgetting 
theories  of  syntactic  structure,  which  posit  that  syntactic  structure  is  removed  from  short¬ 
term  memory  as  it  becomes  semantically  interpreted  (Frazier,  1985).  In  NL-Soar,  syntactic 
structure  drops  from  short-term  memory  purely  as  a  function  of  die  incoming  syiitactic 
structure,  regardless  of  what  other  processing  has  been  done  on  it.  The  theory  also  differs 
from  uniform  resource,  theories,  such  as  CC  READER,  v/hich  posits  a  single  computational 
re.murce  shared  across  all  contents  of  working  memory. 


5.2  Predictions  on  the  PB  and  4E  collection 

Tiiis  section  describes  in  detail  N.L-Soar’s  predictions  on  the  collection  of  43  parsing 
breakdown  constructions  and  acceptable  embeddings  (Tables  2.1 1-2.15).  The  predictions 
are  derived  as  follows:  if  a  construction  requires  three  or  more  nodes  to  be  indexed  under 
the  same  structural  relation,  then  the  construction  is  predicted  to  cause  parsing  breakdown; 
otherwise,  the  construction  is  predicted  to  be  acceptable.  T'his  is  the  simplest  method  of 
apply  ing  the  the  ory  since  it  abstracts  away  from  any  specific  strategies  for  handling  conflicts 
in  the  AYR  set  (i.e.,  how  to  choose  wnich  nodes  remain  in  the  A/R  set  when  three  or  more  are 
vying  for  the  same  indexical  relation).  This  method  cannot  overpredict  difficulty  because 
no  strategy  for  maintaining  the  .A/R  set  can  overcome  the  two  node  limitation.  However, 
it  may  overpredict  acceptabihty.  in  that  there  exist  A/R  set  maintenance  strategies  that 
do  not  ensure  that  the  appropriate  nodes  are  available  for  attachment.  For  example,  one 
rather  perverse  strategy  is  to  admit  <he  first  two  nodes  tinder  each  index  and  then  block  all 
subsequent  nodes.  Such  a  strategy  would  be  completely  disfunctional,  of  course.  We  will 
return  to  the  issue  of  A/R  set  maintenance  later  in  tiie  chapter. 

As  in  the  example  above,  the  predictions  will  be  illustrated  by  presenting  a  partial  view 
of  the  A./R  set  at  critical  moments  For  structures  cat  sing  parsing  breakdown,  the  A/R 
set  will  be  pre.scrtcd  at  the  point  where  one  of  the  relations  must  bear  three  values  for 
the  comprehension  to  continue  suvxe.ssfully.  For  acceptable  structures,  the  A/R  set  will  be 
pre.sentcd  at  points  where  the  A/R  .set  bears  its  maximum  load.  The  phrtise  strueiurc  tree 
'vill  often  ho  given  rc|.'rcsciiting  file  final  or  HUcuncdialc  output  of  NI .  Soar,  d’hese  tr>‘cs  are 
'jen'.'Tiaed  autom  itieaily  from  a  trace  oi  N!  Soar’s  working,  memory  and  edited  for  hieviiy. 

T  lie  predictions  are  groupetl  into  aggregations  oj  similar  stnictures,  with  both  acca.  j  '!a^>ic 
;lIUl  Oieakdo'vii  ci.iislructions  coii.'.idt'ied  in  eaeii  ‘goiijr.  A  sumniarv  table  of  the  results 
■  ippcai.s  at  ifu-  end  of  the  ■'hapli.i 

5.2J  Rigb*  1113(1  leSi  l)ra!U'hi»tj4 

I  aiga  an-uMuts  ■>(  nglit  hi  aueiimg  (D.Ji  are  acceptable  as  mdicaU'd  ahow,  shri^  i'i('aail\ 
jii'.i  oee  liinie  (liie  mosi  l'.■('ellI)  nin-a  Is-  .available  loi  atlaeluiieui  lius  ilieie  i-.  uuai  to 
die  sici  v  I’ui  'is  >ip  nai  .sei '■  ell:  ( ruatei  dit ! it  ii li \'  vwlli  i  igli!  br  am  hiiii,’  i  ( 'lit ai i  ,k 

<v  MiMe;.  pgiti  v.lix  .i'.r.  su  i  tiu  lasid  ilriM'u,  tiolioiii  en  sa,iU‘;.'\  ut  \l  .Soar  ’  I  ii  ' 
p'  ( ihi'' II !  i  -  vs  I'h  !i  ui a lOiia I  bi  .lioiii  up  rsii  st'i  s  ui.a  i  ap.iud  phi . i  a,'  s' :  i.u',  iii  ,■  i  nles  mil'll  l.u  !e 
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Figure  5.  l ;  Growth  of  right  branching  structures. 


NP  NP  PP 
NP  NPS’ 

With  a  right  branching  .structure,  tiie  parser  must  keep  open  every  node  for  possible  further 
bottorn-  up  reducticai.  f'or  example,  in  John  saw  the  hook  on  the  table,  the  NP  the  table 
cannot  he  removed  Iroin  consideration  for  further  expansion,  since  the  sentence  may  con 
tinuc  in  the  room.  This  in  turns  means  that  the  preposition  <)n  must  remain  in  consideration, 
since  it  may  need  to  attach  to  a  new  NP.  Simdarly.  the  book  may  need  to  attach  to  a  new 
PP,  and  so  on  unbounded  right  branching  structures  lead  to  unbounded  requirements  for 
buffering  partial  copsdtucnts. 

NT  .Soar  avoids  this  because  it  is  not  clu'osmg  from  a  set  of  phrase  structure  ruies  to 
expand.  A  inodilieil  NP  may  be  iormed  by  simply  adioming  to  an  existing  NP.  bigiire  .S.l 
sliow.s  {lart  of  the  increMicnlal  growth  ol  the  utterance  model  tor  (d4)  There  is  no  need 
to  delay  attachment  decisions.  The  only  limit  that  can  arise  is  in  the  luimher  of  (lolenti.il 
altachnical  sues,  an  issue  addressed  above  am.!  in  ('haptci  T 
l.eft  blanching  striK.'tine.s  aie  ,i!so  .u'ceptable  if'ignrc  0.2): 

pt-S;  Ai  .?)  Her  sons'  emploxee.'.'  contributions  .  .  . 

Ihs  .iii.'-c  .NI  .So.ii  IS  In  I  lid  nil,'  I  he  liee  bon  can  ui;,  Kdt  branching  sii  le  nnes  au'  casi  l\  fiandh  d 
1  (  h.nri  ,k\  tV  .Millei ,  I  do  ' ) 
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Figure  5.2;  Left  branching. 

5.2.2  Center-embedded  relatives 

A  single  center-embedded  relative  clause  can  be  parsed  because  just  two  NPs  must  be 
buffered  in  the  spec-IP  (subject)  relation  of  the  receiver’s  set  (phrase  structure  in  Figure  5.3). 

(96;  AE;3)  The  dog  that  the  cop  saw'  is  big. 

Receivers  soec-'P:  the  cop],[j^r  the  drig) 

As  we  saw  earlier,  tw'o  such  einl  iddej  relatives  (PB!)  leads  to  breakdown  because 
three  NPs  must  be  available  or:  he  'pec.-iP  relation.  !l  is  irrelevant  whether  the  overt 
complementizers  arc  prc.sent  or  n  it,  1  >th  :.tructurcs  arc  predicted  to  cause  difliculty: 

(97;  PC2)  'I  hc  man  the  wmnaii  the  dog  bit  likes  eats  hsh. 

An  objeci  lelativc  may  he  embedded  in  a  VVli  qu'  stion  vviilioiil  causing  bieakduwn: 

(bS,  At  4)  W'iia!,  did  the  luan  that  Mary  likes  eat  ' 

In  (b'H),  die  NP  |\7’u7i(i/|  is  .iiiaciicil  immcdi.itelv  ni  s[iec  t'P  [nisiliou,  as  ui  i•^gliIe  b.-l, 
so  docs  not  coniiibiitc  to  mlerterence  on  the  specdP  relation  In  fact,  |,v/>  the  rnati]  is  also 
attached  nnmediatelv,  occn;  ving  s[iec  IP  position  o!  the  IP  pi(i|ee!etl  tii'pi  duL 
Now  eonsider  a  dil'ticulfv  embedding  in  in  a  Wh  ijiiesiion  (f,  lilison,  lo')|  ): 

d!'t.  I’ll  1 )  \Vlio  did  .loh:-  doii.iie  the  Inrniiiiie  ih.il  Itie  iep.!ii  mail  Ihai  die  dog  hit  louiid 
to' 

All  o  \  ei  lo.uied  spe  c  I  P  lehiiion  on  the  lei  eo  cis  sv ;  eaiinol  he  die  som v  e  ol  dii  luadf  \  m 
j\;'  U7i(,'/|  ,ind  l\7.  ./iihn\  lu'cil  no!  lie  huftered  in  die  leensei  ■.  .et  heeaiise  (he\  .iu“  aliaehed 
iininedialeh  .e  iii  1  igiii e  ">  I  Sin  pi  i  ,iii;'l\,  iiowa"-. ce  'sf.  So.  i  does  (‘ledief  didii  iills  lieie 
loi  .uiotliei  KMsoii.  i  'i  iiisidiei  the  lelalion  ol  vi!i|eei  Ir.iees  .uid  aiiteeedenls  ici  ( i 
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)  45 


(100)  Who,  did  John  donate  the  furnitufc,  that  the  repairman*,,  that  the  dog  bit  t; 
found  t,  to  t,? 

Each  of  the  traces  is  generated  by  accessing  the  aiuecedents  in  spec-CP  position  via  the 
assigners  set.  Dy  tl;e  second  relative  clause,  three  CP  nodes  must  be  inde-\ed  on  the  spec-CP 
assignors  leiation  in  order  to  generate  the  correct  traces: 

Assigners  ["  spec-CP;  [cp  who  did],\cp  that],  [cp  diat] 


Thus,  (99)  causes  difficudtv  because  c>f  an  overload  on  the  assigners  set,  rather  than  the 
receivers  set. 

Eady  and  Fodor  ( 198 ! )  discovered  that  placing  the  modified  NPs  in  post-verbal  position 
increases  the  acceptability  of  center-emnedded  relatives  ("(lOB)  from  (Gibson.  1991)): 

( 101 ;  AES)  The  cop  saw  the  man  that  the  woman  that  won  the  race  likes. 

(102;  AE6)  Tlie  cop  saw  the  man  that  the  boy  that  me  dog  bit  likes. 

( 103;  AE7)  John  donated  the  furniture  that  the  repairman  that  the  dog  bit  found  in  the 
basement  to  charity. 

Ni-Soar  predicts  that  these  structures  are  acceptable  because  once  the  man]  is  attached 
in  comnlement  position,  it  is  no  longer  indexed  in  the  receivers  .set.  In  the  case  of  the  object 
relative  (102),  the  A,'R  set  must  index  at  most  two  NPs  o:n  tlie  spec-IP  relation; 

Receive^^  spec-IP-  ^'he_boyUf^p 


Note  thal  the  creation  (-f  the  trace  in  object  position  doc.s  not  .require  ihe  antecedent  to  be 
in  the  receivers  set  The  antecedent  is  accessed  via  the  assigners  set  in  S{x:c-CP  position 

In  tlie  subject  relative  (101),  at  most  one  NP  must  be  indexed  in  the  receivers  .set 
(Iav'  the  wcrnan]).  Ihi:.  means  'lia*  ■enler-ernhccdcd  subject  relativc.s  are  predicted  to  be 
acccjitabb',  even  in  iireverbai  position,  in  contrast  to  the  difficult  object  relatives: 


t  104,  t  he  man  shat  the  vvuinan  tliai  won  the  lace  likes  eats  iish 


Inilift,  1,0*1)  or  (h*  ,'),  only  two  NPs  ( [  j  and  |  v,c  n-.'nufn  [  niiisi  be  iiu'exed  sirnulia 

iU'oiisiv  holding  lo  tiic  juuginionis  presented  ui  (<,.’nlison,  Ififi!  ),  ihis  is  ihe  tiiAi  !!it:oriect 


irod.  . imn  ol 

the  model 

However  at  ie. 

veil-  tioii  Is 

111  live  iigti! 

dM.‘;'sSio.!  1  here 

n  gonerai  th 

an  sufiieei  r 

ohinves  (lord,  1 

lii  the  c  ipn 

as  a!  0  \  o  ii'iu 

.(■  on  iioulile  eon 

oi.. ova's  SI) 

1  f !i  :i(  'v^.  int ; 

viital  Imicsumk  h 

as  l,a  as  rehitne  diffiouiiv  (.'o.'ioc! nod,  ihi-, 
\  !deiu:e  that  objeot  iclalncs  are  moie  chltu  un 
;  Ikiiriifs  A  (>  lugaii  IhOo' tiiualels, 

(■  on  iioulile  oentci  enthediiingos  uses  olijc,'!  fclaiivsv,,  luii  salnect 
tnents  tor  ,  !0-h  ui'isi  Siittii  s' 
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lliai  Jill  left 

V  NP 
bothered  Sarah 

F'IGURE  5.5:  The  wrong  way  to  arialyse  subject  sentences. 

5.2.3  Subject  sentences 

Subject  sentences  are  sentences  that  appear  as  subjects  of  other  sentences: 

(10.5;  AE8)  That  Jill  left  bothered  Sarah. 

Kimball  (1973)  noted  the  unacceptability  of  embedded  subject  sentences: 

(106;  PBS)  That  that  Jill  left  bothered  Sarah  surprised  Max. 

The  most  straightforward  analysis  of  subject  sentences  places  the  complemcntized  sentence 
(CP)  directly  in  subject  (spec-IP)  position  (Rosenbaum,  1967;  Gibson,  1991),  as  in  Fig¬ 
ure  5.5.  Under  such  an  analysis,  NL-Soar  would  predict  the  difficulty  of  (106),  sinci  three 
nodes  must  occupy  the  spec-IP  relation: 

I^CEIVERS  I  spec-IP: _ [c/>  thatUcp 

However,  Koster  (1978)  presents  compelling  evidence  that  Thi.s  analysis  is  incorrect. 
He  points  out  a  number  of  anomalies  that  arise  as  a  con.sequence,  including  the  two  below; 

( 107)  (a)  *  Although  that  the  house  is  empi  y  may  depress  you,  it  pleases  me. 

(b)  Aithougii  it  may  depress  you  that  the  hoi  se  is  enijiiy,  it  pleases  me. 

( !08)  (a)  *Did  that  John  showed  up  pie.ase  you? 

(b)  Did  John’s  showing  up  ple;tse  you'/' 
ic)  *What  does  ihat  he  will  coi  w  prove  ' 

(d)  What  does  his  corning  prove'' 

.Subjccl  .sentences  an?  generaiiy  ungianunatical  in  subcadinaic  cSau.ses  tlO/i,  and  caiu.ut 
serve  as  th  -  subject  in  subject  auxiliary  inversiorts  (108),  But  ;f  the  structure  ui  I’igu'c  5.5 
is  conccl.  these:  ceusiruction--.  sh.ould  be  aceeptabk*. 

Koster  [irovides  ^ui  aiternaave  anaivsis  that  places  the  sutueel  scuteuce  in  a  ui(i,i \dizcd 
po>,iiiori  lioimd  by  a  phcs[K!icu.’iea!lY  null  trace.  Such  souciuitss  aie  ahc.tdy  renuiretl  to 
handle  lopii'ali/asion  hi  Hnehs!)  (KciSiCr  presents  atiduioiuil  ilui.i)  cxai;ir:»si;s i: 

f  i.ii  t  de\a-[,  sfie  eei  iainis  !:>  t,. 

(to  rius  book.,  !  askeii  Bdi  fo  icari  t,. 


5.2.  PrcdicUon''  on  the  PB  und  AE  colkci’ori 


147 


IP 
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\ 

VP 

,/''v 

\ 

NP 


bothered  Sarah 


FlCiUl'^E  5.6:  Yopic?lization  analysis  of  sentence  subjects. 


Since  topicaiizacion  is  a  root  phenomenon  (i.e.,  it  generally  occurs  only  at  the  level  of  the 
main  clau.se,  not  in  embedded  contexts),  this  analysis  predicts  the  unacceptability  of  (106) 
on  grcnimatical  grounds,  because  (106)  requires  an  embedded  topicalization.  A  relative 
clause  with  an  embedded  subject  sentence  should  ,'>!so  be  unacceptabie: 

( 1 10;  rB6)  *The  woman  (hat  for  John  to  smoke  would  annoy  works  in  this  office. 

(Ill;  PB7)  *T1k:  company  hired  the  woman  that  for  John  to  smoke  would  annoy. 

Koster’s  1981  treatment  of  topicalization  involves  a  new  phrase  structure  rule.  In 
modern  GB  syntax  topicaliiation  is  simply  analysed  as  adjunction  to  I,P  (Rtxhemont  & 
Culicover,  1990).  Figure  5.6  shows  the  revised  structure  for  subject  .sentences. 

This  analysis  has  immediate  processing  imolications.  Since  the  subject  sentence  no 
longer  occupies  the  .swe-iP  position,  is  should  be  possible  to  embed  a  subject  NP  modified 
by  an  object  relative  clause  within  a  subject  senteiice: 

(!  12;  AES1)  That  the  food  diat  John  oraeied  tasted  good  plea,sed  him. 

This  is  .  orreedy  predicted  to  be  acceptable,  since  the  tnree  initial  phrases  iue  distributed 
across  two  structural  relations  in  the  A/R  set: 


adjoin  IP: 

Icr  I'hall 

spec -IP: 

!s7>  the'  food],  [/VP  John] 

Since  fronted  clauses  are  also  adjoined  to  IP,  a  fronted  clause  fo'iowed  by  a  subject  relative 
is  acceptable: 

( !  I  3;  Alilb)  While  Mary  .slept,  the  .sock  that  the  dog  chewed  fell  on  the  floor 

The  sainc  jnedietioii  bolds  for  topicaii/cd  .NPs  as  well. 

(  1  14,  ,'\i  ! !  )  Bob.  the  gii!  tha!  t.he  dog  scared  iike.s. 

Bv  iKaruiialf/sng  snbji‘et  sentences,  it  is  possible  to  embed  them  without  causmg  fiarsu'C 
hiecKci(.)vvii  {Kunha!l,  I ')?.V.  ( libsiin,  ;99i): 
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IP 


C 
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IP 
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[• 

aiigcted  the  boy 


NP  r 

/'  \  sui’prise/l  her 

NP  N’ 

Joe’s  I 
N 

leaving 


Figure  5.7;  Nominaiized  siitjcct  sentences. 
(115;  AE12)  That  Joe’s  leaving  surpiised  her  angerec*  the  boy. 


The  structure  for  (1 15)  is  shown  in  Figure  5.7.  NL  Soar  handles  such  structures  I,)ecause 
they  are  left  branching. 

Although  the  restriction  on  embedded  tooicalt'ation  appears  to  be  ’jniver.sai,  oreun 
(1976),  Hooper  and  Thompson  (1973),  and  Koster  (1978)  hiimelf  havt=;  al!  noted  diat  root 
phenorraena  are  sometimes  marginally  acceptable  in  the  complement,'.,  of  a  mstt  icted  -..less 
of  English  verbs: 


( 1 1 6)  (a)  I  know  that  John,  she  likes. 

(b)  ■*T  persuaded  him  that  B»ll,  Herb  likes. 

(c)  '■  f  hoped  that  John,  she  likes. 


Thus,  in  some  ctsses  ecibedded  sentential  .subjects  will  be  ac-ceptablc’. 

!  18;  /iEi;3)  i'hc  cop  believes  tliat  for  tiro  bv.>y  to  stnoke  is  bad. 

( ! !  9;  AFi  4)  Mary  held  the  belief  that  for  Johii  to  srnokt.  would  be  annoying. 


Ailhough  these  corrsti  uctions  may  be  grammiti'cally  rnargirtai,  th;'  questioi!  for  Ni ,  Uo-i'' 
IS  w!(,“tlK'r  such  structures  can  he  processed,  'i’hey  can  in  fact  ;;»e  p-rocessc-el;  rso  rei;ui<jri 
(oos!  hidex  more  ihai;  two  nodes.  Figure  ;')-8  gives  the  stiuctur-T.  preaiuced  (V; :  il  18, 

The  tenseU  vi  rsiuns  nt  ihese  ^iibit'cs  .sc:nc,'ie*;s  apprai  iu  in,  iai  (ess  accepCibk*' 

1  (7)  (a;  ’*]  beneve  t.hai  rhai  John  sinakcs  yvinoys  rne. 

( b)  '*  Mai  y  lieid  ilur  Injial  thai  ihai  Joini  .SiiiOke.s  i'  anru.iynig 

it  NCi  rns  tliai  there  is  shijk;  nroblern  it'later!  ttj  the  tept  art'd  ihai'.'.  which  is  u'drpc.Mdfii’.  any  i4  the  sss  le; 
iittn  'iited  iu'ii',  but  it  is  cnincis  niK  leal'  wnnslte''  the  niohietn  ts  a  pionessinp  issue  oi  a  gtans  tiSM:  al  *ssoc 
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NP 
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VP 
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CP 

bdi'.;v?,s 

1 

1 

1 

C’ 

c 

that 


\ 
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\ 
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FiGUPJb  5.8;  Acceptable  eir bedded  subject  sentences. 


ITiis  decs  ^ead  i:o  a  problematic  prediction,  a  subject  sentence  embedded  in  a  comple  ¬ 
ment  of  a  !.;.:ijcct  NP  shor.ld  be  acceptable,  in  ^‘act  iv  ooes  seem  to  cause  some  difficulty 
(Gibson,  1991): 

(120;  PBX)  ‘^Ma'y’s  belief  that  for  .lohn  to  snokc  would  be  annoying  is  apparent  due  to 
her  e-xpre?sion. 

cen  piocess  this  precisely  because  the  sabject  sentence  can  be  indexed  via  the 
adjom--IP  re.!atjon; 

;  adjcin-.lP:  ico/'-rl 

[_  spec-IF'  i  V.”  hc/(e/f  Jo/inj 

however,  tlie  contrast  oetween  (120)  and  (1 19)  is  not  particularly  .m  iking.  The  marginal 
granin  aticil  .status  of  euibcdded  subject  sentences  in  generai  iriakes  it  a  soniewhar  ditfh.u!t 
to  '■  v^i„a;e  .•essoeg  tatui/cs  against  these  constructions. 


y..2.A  Ccn}jsiej.*:^Lt5  ai  r«r>n)inals 

f'sirg  nouiis  inai  take  senfeni'ai  couinkunents  (dir  hrlirf  that,  the  noasihslitv  (har,,  n  ss 
ocssible  ty>  (.oostrucl'oiiN  that  icqu're  bnlicring  Un'tc  or  more  subject  I'li’s  without 

in'if.g  any  iCianvc  clause  inoriii;c,i!io;i,  NL,  St.ar  concctlv  predicts  the  diflicalty  on  such 
cmbcoding* : 


mv ; 


.M.itUi’s  s'.isp.irKii’i  that  a  ruoK)!  in.ti  the  ci-c'oluiit  ii.ui  no!  f'lcen  iiin  l.ui 
true  ii V os .li  linn  io  uoes'i'.’atc  iorlitci 
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Receivers  |  spec-  IP  suspicion],  l/<,rp  rumor],  l^p  c/ection] 


The  same  prediclior.  holds  for  certain  mixes  of  complement  clauses  rnd  relative  clauses; 

{ 122;  PBIO)  The  man  who  the  possibility  that  students  are  dangerous  frightens  is  nice. 

By  using  a  subject  relative,  it  is  po,ssible  to  create  an  acceptable  embedding  of  a  relative 
clause  w'ithin  a  complement  cause: 

(123;  AEI5)  The  thought  that  the  cop  that  hit  the  boy  was  rich  angered  her. 

NL-Soar  handles  this  because  thought  and  cop  are  the  only  NPs  that  must  be  indexed  on 
the  spec-IP  relation.  Figure  5.9  shows  the  phrase  structure. 


Receivers 


spec-IP: 


[/VP  the  thought],  [/vp  the  cop] 


An  object  relative  embedded  within  an  NP  complement  should  also  cause  breakdown,  but 
the  result  is  considerably  more  acceptable  than  (122): 

(124;  AE16)  The  thought  that  the  man  that  John  liked  screamed  scared  me. 


This  is  an  incorrect  prediction  by  NL-Soar,  but,  as  in  the  earlier  case,  it  does  at  least  correctly 
predict  that  object  relatives  are  more  difficult  than  subject  relatives. 

NL-  Soar  correctly  predicts  that  NPs  with  sentential  complements  may  appear  m  subject- 
auxiliary  inversions: 


(125;  AEE?)  Who  did  the  information  that  Iraq  invaded  Kuwait  affect  most? 
However,  NL-Soar  also  predicts  that  another  embedded  clau,se  should  be  acceptable: 


(126;  PBi  1)  Who  does  the  information  that  the  weapons  that  the  government  built  don’t 
work  properly  affect? 


This  incorrect  prediction  arises  because  does  projects  both  CP  and  IP  phrases,  so  that 
information  is  attached  in  s[)ec-fP  position  as  soon  it  is  encountered  (see  again  Figure  5.4). 
'riius,  only  weapons  and  government  must  be  indexed  in  the  A/R  set  simuitaneously. 

As  with  the  case  for  embedded  relatives,  moving  the  complement  embeddings  to  post¬ 
verbal  position  inc! cases  their  acceptability: 


(127;A1.iS)  riic  pentagon  employs  many  hiueauciats  who  the  information  that  Iraq 
invaded  Kuwait  alTectcd. 

\  1?H;  Al  l'M  Ihc  |)ii)lcssui  did  not  Ixdicve  my  claim  dial  the  rcpoit  ihat  the  school  was 
coiru[)t  wa'-  huc.cd. 


I'oi  example,  m  t  12/i,  onh'  two  NPs  must  tie  i  dexed  in  spec  If'  sntee  hnream  rats  is 
leiians'd  t  roni  tiie  id-  eivei sel  once  il  iies  upies  the  ruinplemenl  position. 


Ki  (  1  IS  I  Ks 


spis  li’ 


|\/.  inpirni(!lu!n\  I  v.;-  /oe/| 
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IP 


N  CP 

thought  I 
C’ 


hit  the  boy 

Figure  5.9:  Embedded  complements  of  nominals. 


5.2.5  Clefts 

The  cleft  constiuction  in  English  is  a  particular  kind  of  predicate  complement  construction 
that  .serves  to  focus  part  of  the  sentence.  There  are  two  types.  The  cleft  has  it  as  its  subject 
and  something  like  a  relative  clause  at  the  end  (129b): 

( 129)  (a)  The  man  saw  a  dug. 

(b)  It  was  a  dog  that  the  man  saw. 

The  pseudo-cleft  has  something  like  a  Wh-clause  in  subject  position: 

( 130)  What  the  man  saw  was  a  dog. 

Ni,-.Soa!  correctly  predicts  that  twu  einlx'dded  relative  clauses  in  the  complement  NF  uf  a 
cieli  stioui'l  be  acceptable: 

(  I  3  1 ;  A(;:'())  It  c.  as  a  dog  dial  the  m.in  Ih.it  tlie  t  up  admired  saw. 

In  (  1 I  !  the  eoiiiplcmeiit  NF  i/ui'  need  nut  he  held  Mmuiianeoiisiy  with  the  iwt.>  .‘'i.ii)|ei;:ts 
ot  ilic  emix'dded  clauses  [muu,  (  Op).  The  same  predictioi)  liokis  toi  clefts  with  etTibfd'.icd 
cnm'dt.mK.  Ills  of  NFs: 
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I  was  Ihe  dog 
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NP  I' 

the  man  /  \ 
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V’ 

/  \ 

V 

saw 

Figure  5.10:  a  pseudo-cleft. 

( 1 32;  AE21)  It  was  the  cop  that  the  information  that  the  dog  bit  the  boy  influenced. 

Adding  one  more  embedded  clause  in  results  in  parsing  breakdown; 

(133;  PB12)  It  is  the  enemy’s  defense  strategy  that  the  information  that  the  weapons  that 
the  government  built  didn’t  work  properly  affected. 


Receivers 


spec-lP: 


[f^P  information],  |,v/'  weapons],  l^p  government] 


A  clefted  .subject  sentence  is  unacceptable  for  grammatical  r  easons,  since  the  topicali/.ed 
subject  sentence  cannot  appear  in  such  embedded  contexts; 

( 134;  PB13)  *It  is  the  enemy’s  strategy  that  for  the  weapons  to  work  would  affect. 

The  initial  \Vh-clau;se  in  p.seudu-  clefts  is  analysed  as  a  headless  relative  clau.se.  Fig^ 
me  5.10  shows  the  structure  for  (130).  The  interaction  of  this  structure  with  the  A/R  .set 
leads  to  some  interesthig  predictions.  Because  the  initial  Wh-word  does  not  occupy  spec-IF 
posifion.  It  should  he  possible  to  embed  an  additional  relative  clause  within  the  tieadless 
relative  without  causing  difficulty: 


(  1.15;  .\t:22)  What  the  wdiikhi  that  .lohn  marned  likes  is  smokctl  salmon 


iST  .So;!!  handles  this  hecaiisc  only  two  .NPs 


need  to  he  simultancuiisl v  indcKt  i]  .>n  oiu: 


iclaiinn: 
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RECKIVERS  spec-IP;  (/v7'  woman],  \jsip~Johni 

A  .similar  prediction  holds  for  an  embedded  sentential  complement; 

(136;  AE2.t)  W'.iat  the  rumor  th.at  the  accused  man  had  robbed  a  batik  induenced  Vv'as  the 
judge’s  decision. 

As  predicted,  one  additional  embedded  clause  does  lead  to  breakdown; 

(137;  PB 14)  'V\''hat  the  information  that  the  weapons  that  the  government  built  didn’t  work 
properly  affected  was  the  enemy’s  defense  strategy. 

Sentence  subjects  in  pseudo-clefts  are  ruled  unacceptable  for  the  grammatical  reasons 
discussed  above; 

(138;  PB  15)  What  for  the  weapons  to  work  properly  would  affect  is  the  enemy’s  strategy. 

5,2.6  Though-preposing 

Though-preposing  is  another  kind  of  focusing  construction  which  is  used  to  front  predicate 
compleme.nts; 

(139)  Intelligent  though  the  man  is,  he  has  no  sense  of  humor. 

The  phrase  structure  is  given  in  Figure  5.1 1.  The  moved  AP  occupies  spec-CP  of  the  CP 
headed  by  though.  Thus,  the  fronted  con.stituent  does  not  interfere  with  buffering  NP ;  on 
the  spec-IP  relation,  predicting  the  acceptability  of  an  embedded  relative  clause  (Gibson, 
1991); 

( 140;  AH24)  Intelligent  i  ough  the  man  that  Ellen  married  is  . . . 

kbCElvi.RS  I  spec-IP:  j,vg  .'/ia/i].  Iwr  hAlcn] 

Or  an  embedded  sentential  complement; 

( 141;  Shocking  tliough  the  news  that  iratj  invaded  Kuwait  was 

.Adding  one  additional  embedded  t  lause  leaiis  to  breakdown,  as  [ucdictcil: 

(l•12;  Pit|(i)  .Snipnsing  though  the  iiiloniiaiiDn  that  tin'  weapons  llu'  go'.. c,  niiKMii 
hjilt  didn’t  vvoik  propeily  was  .  . 

rhongli  pie|)osmg  with  a  sentence  sni>)cet  's  iimu.'icf'table  loi  the  gianinniiical  rea'-ons 
t]isenss''d  abi ivc: 

(  11  g  I’l;  I  ; )  'Siirpi  iMii;'  lln.iiigh  tor  ihe  'Arsipons  in  'Asuk  propei  !\  .  uiid  he  . 
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Figure  5. 1 1 :  Though-pveposing. 


5.2.7  Pied-piping 

Pied-piping  is  a  right-branching  structure  that  can  be  used  to  avoid  stranding  prepositions: 

(144)  (a)  *l’he  table  wl  u'h  the  man  put  the  block  which  the  cop  put  tiie  dog  on 

on  is  big. 

(b;  AE26)  The  table  on  which  the  man  put  the  block  on  which  the  cop  put 
the  dog  is  big. 

The  phrase  .structuie  for  (144b)  is  given  in  Figure  5.12.  As  Pickering  and  Barry  (1991) 
point  out,  pied-piping  presents  a  challenge  for  ptorsers  that  must  wait  to  posit  the  object 
truce.s  until  dieir  surface  positions: 


•  I  U  !  The  table  on,  which  the  man  pul  the  block  oig  whi'di  the  cop  put  the  dog  T 
t,  is  big. 


If  NT,  Soar  was  foiced  to  wait  to  [losit  traces,  then  pied  pipi,'<g  would  f  iCcoiue  (inacrept 
able.  The  coniplenient  relata)!i  in  the  a.ssigners  .set  would  eventually  drop  veii>s,  preventing 
the  generation  of  the  traces,  llowcvc!',  there  is  no  reason  Nl.  Soar  niu.st  w.ut  for  tlic  surface 
poMiioii  to  posit  tlic  trace.  As  di.scus.sed  in  S.Te..^,  t!;e  trace  is  gcneiaied  as  soon  a--,  die 
aritceedenl  and  the  .sirucliiiai  assigner  are  available  I'hus,  .^''1.  Som a. o.r•ecslv  jireviicts  ti  .u, 
like  f  iglii-biaiicinng,  pied  [iiping  may  he  eonliiiued  indehmiel y: 

(  I  K)i  \\(  saw  die  the  table  oi!  which,  the  man  put  ilic  tdo'.  k  >ii  ''.hu  i  die  ,  <'(>  pui 
die  do;.’  on  winch  (tie  boy  jnit  die  eoilm 
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f'K  ICKI  bid:  i  ‘h'iI  (iipM;' 
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1 .  Independence  of  ambiguity 

2.  Insufficiency  of  embedding  depth  alone  to  cause  parsing  breakdown 

3.  Fairly  sharp  drop  in  acceptability  at  two  center -embeddings 

4.  independence  of  length 

5.  Independence  of  short-term  item  memory 

6.  Little  effect  of  explicit  instruction  and  training 

7.  Some  effect  of  semantic  content 

Figure  5. !  3:  Qualitative  phenomena  of  parsing  breakdown  (from  Chapter  2). 


5.3  Accounting  for  the  major  qualitative  phenomena 

Chapter  2  presented  a  list  of  seven  qualitative  phenomena  surrounding  pitrsiiig  breakdown, 
summarized  in  Figure  5.13.  Most  of  these  phenomena  arc  accounted  for  automatically  as 
a  result  of  NL-Soar’s  performance  on  the  AE/PB  collection;  others  requue  some  additional 
plausible  assumptions  that  are  consistent  with  the  model. 

Independence  of  ambiguity.  Parsing  breakdown  can  happen  in  NL-Soar  independently 
of  ambiguities.  The  contents  of  the  Ay'R  set  arc  not  strictly  a  function  of  how  ambigiuties 
are  resolved,  t)ut  a  function  of  the  interfering  properties  of  the  incoming  syntactic  structure. 
In  the  classic  difficult  center-embedding,  all  of  the  subject  NPs  are  not  available  because 
the  A/R  set  cannot  index  them  all  simultaneously.  Ambiguity  is  not  an  issuef 

Insufficien<:\  of  embedding  depth  alone  to  cause  breakdown.  The  A/R  .set  ;:)Iace,s 
no  limits  on  the  amount  of  phrasal  embedding.  The  limit  is  strictly  in  the  bultering  of 
identically-mdexed  phra.ses.  This  may  or  may  not  lead  to  limits  on  einbediling,  dep-enduig 
on  the  syntactic  structure,  in  particular,  llicre  is  no  limit  on  riglit  or  left  biaactnug,  while 
there  arc  .severe  limits  on  centei  cmbeilding. 

iiiiriv  .siuop  tlrap  in  acceptaiii/ilv  oj  <  met  t-mhctjilings.  1  iic  cnipiricai  evidence 
pit’scnicd  IP  ('h.ipter  2  iiiade  cleat  tioiii  a  range  of  incasiire,^  ih.it  douMy  cciitci  enibiskicd 
ol'jcct  relatives  were  unaeeeplabie,  while  singly  embedded  ohjeet  leiaiives  aie  acce[rtable, 
Nl.  .'.ir  predicts  this  sliarp  tirop  m  .icce|)tal'ilil v  simply  iiecaiisc  it  i:.  the  thiic!  NP  m  she 
doiihiv  cmbcddeii  I'oiisuiiclion  tli.it  exceeds  the  A/R  wi  e.qiaeity 

hhlept'/idfiict  at  Ifngth.  I  tie  liimtations  ol  the  .-X/k  set  ,irc  a  liiiiciioii  of  die  '.\  iil  .,tic 
stun  lute  of  die  iiijim  iiol  lis  length  \Vc  s.iv\'  .ibove  that  siii'il  scuitcnces  (c.s'  .  I’h.'i  krai  icaii 
!i>  l>ii  !kiio\\  n,  \s  !iilc  long  seiilcnccN  au- .icccpl.il)le  ic  g  .  x!  /id  in  gkuici  al,  Nl  Soar  [>!.uvs 
th'  Inim  on  (he  icnglh  o!  .iceepl.ihle  si-iiieiu’cs. 

htJ-jH  luitfhi'  t’t  shi'fi  iittii  iii'iii  iiKriii  irv  I’aisnig  I'icakdou  ii  i  an  oci  in  c\cii  dii'io.'ti 
sni  ip  k  In  .in'  ,,l  he  lo  u'k  al  i  .ill  liu’  ileiiis  in  I  he  sciiteiKC  I  L.ii  k  m  .V  I  hn  ns,  )  o  i  11  h-  ,i  h  .i  i 
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term  memory  accessed  in  this  case  is  very  likely  to  be  a  phonological  memory  (Baddeley, 
1990).  The  limitations  imposed  on  the  AyR  set  have  nothing  to  do  with  any  limitations 
of  the  phonological  buffer.  Thus,  it  is  possible  to  find  short  .sentences  which  overload  the 
syntactic  processing  without  overloading  the  phonological  buffer.  This  is  precisely  what 
happens  in  the  case  of  short  center-embeddings;  the  items  can  be  recalled,  but  cannot  be 
parsed. 

Little  effect  of  explicit  instruction  and  training.  The  structure  of  the  A/R  set  is  an 
architectural  hypothesis;  it  cannot  be  modulated  by  knov/ledge,  This.  NL-Soar  predicts 
that  the  limitations  that  arise  from  the  structure  of  the  A/R  set  cannot  h;  overcome  without 
using  some  auxiliary  memories.  One  such  possible  auxiliary  memory  is  paper  and  pencil— 
eventually  very  deep  embeddings  can  be  worked  out  with  enough  time  and  patience.  But 
there  is  another  more  interesting  possibility:  the  phonological  buffer.  If,  as  argued  above, 
the  phonological  buf'ercan  maintain  a  short  double  centei  embedding,  then  with  sufficient 
practice  it  may  be  possible  to  deliberately  work  out  the  correct  pairings  of  the  words  based  on 
this  memory  alone.  In  fact,  the  subjects  of  Blauberg  &  Braine  (1974)  were  able  to  increase 
their  ability  to  comprehend  cenfer-embeddings  by  one  level — that  it,  they  eventually  learned 
to  comprehend  double  embeddings,  but  not  triple  embeddings.  The  plausible  explanation  of 
this  result  is  that  the  triple  embeddings  exceeded  the  short-term  capacity  of  the  phonological 
buffer,  so  the  newly  learned  pairing  .skill  could  not  be  applied.  Note  that  working  out  the 
correct  pairing  is  all  that  i.e  required  to  succeed  on  the  comprehension  test;  it  is  not  at  all 
clear  that  the  subjects  al.so  learned  to  perceive  the  structure  as  grammatical  (Marks,  1968). 

Effect  of  semantic  content.  Comprehension  of  .semantically  supported  center  embeddings 
is  better  than  .semantically  neutral  center-embeddings  (.Slolz,  1967).  Nl.  Soar  cannot  ac 
count  tor  thi;';  result  witliin  the  coniines  of  the  A/R  set  for  the  reasons  mentioned  above. 

1  lowevet.  all  that  is  needed  to  siaceed  in  these  comprehension  tests  is  some  ni'nii  ry  ot  the 
Items  m  the  sentence,  C(nn!>ine<t  with  general  knowledge  that  (lermits  plausible  iclations 
to  be  cstahiistieil  between  the  ile’  is.  isir  siioit  sentences  id  the  kind  used  in  tlie  studies, 
tile  phonological  short  leiin  memoiv  may  saflice,  or  [vrhairs  llie  partial  situation  model 
constructed  Irom  the  comprehensK>n  ol  ttie  nouns  aiiii  verlrs  In  eilhei  case,  seiu.uitK 
knowledge  l  onid  I'c  In ouj.d't  to  be,ir  lu  produce  a  [daiisilslc  p.ni  u.g  ot  items  w  ithout  the  ,-\/l* 
set  (f'as'ing  a  sign.lic.mt  lule. 
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Chapter  5.  Parsing  Breakdown  and  Acceptable  Embeddings 


Table  5. 1 :  Summary  of  predictions  on  AEyPB  collection. 


Right  and  left  branching 

°  AE2  • 


Center-embedded  relatives 

AE3 

AE4 

AE5 

AE6 

AE7 

PBl 

PE  2 

PB3 

PB4 

« 

• 

• 

0 

AES 

• 

AE9 

• 

PB5 

AElO 

• 

PB6 

• 

Subject  sentences  and  topicalizations 

AEll 

• 

PB7 

«> 

AE12 

• 

PBS 

0 

AEi3 

• 

.A,E14 

• 

AE15 

0 

PB9 

• 

AE16 

« 

Complements  of  norninals 

AE17 

• 

PBIO 

» 

PB 1 1 

0 

AE18 

• 

Ah  19 

• 

AE20 

• 

PB 1 2 

• 

AE2 1 

• 

PB )  3 

Clelts 

AIM  2 

• 

PBl  4 

« 

AlM.i 

• 

PB  1 ,3 

• 

AH24 

• 

PBIb 

• 

1  hougli-pieposing 

AE2."' 

• 

PBl  7 

• 

I’led  piping 

AlM’h 

• 

•  coiifi  l  prediclion 
O  uiconecl  pretluliun 
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:s  wuMli  e.-mnieniiii!.’  Iieie  on  tlie  idle  ol  sviUaelie  t!ieoi\'  iii  .tie  model.  Lhe  model  ean 
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(1988),  with  i^s  ontology  of  41  surface  grammatical  relations  for  English,  may  very  well 
increase  discriminability  enough  to  change  some  predictions  of  break.dov«n  to  predictions 
of  acceptability 

While  it  would  be  worthwhile  exploring  alternative  syntactic  theories,  such  a  task  is  a 
substantial  undertaking.  Changing  the  underlying  syntax  will  have  potential  ramifications 
in  the  predictions  for  garden  path  effects  (Chapter  6)  and  ambiguity  resolution  (Chapter  4) 
as  well  as  parsing  breakdown.  At  this  point,  the  most  that  can  be  concluded  is  that  the 
Government  and  Binding  .structures  do  a  good  job  of  helping  inakc  the  right  predictions, 
and  it  is  not  clear  that  any  alternative  syntactic  theory  would  do  significantly  better.  In 
fact,  we  have  seen  several  cases  (such  as  subject  sentences)  where  the  precise  analyses  of 
GB  lead  to  rather  interesting  predictions  that  would  not  necessarily  be  captured  by  other 
approaches. 

Perhaps  the  most  significatit  open  issue  for  the  'heory  is  establishing  the  nature  of  the 
strategies  fot  resolving  condicts  in  the  A'R  .set.  Tfie  is.sue  was  effectively  avoided  here  by 
using  the  theory  in  a  way  that  abstracts  a’way  from  the  effects  of  particular  strategies.  But 
the  questions  remain;  Wfiich  strattgies  are  the  right  ones?  Are  tiie  strategies  learned,  or 
architectural?  The  question  :s  an  in 'csting  one:  vve  saw  in  Chapter  4  that  the  obvious 
alternative  fo,  an  architectural  strategy  ypure  recency)  is  unlikely  to  be  correct. 

Beyond  the  few  nfissed  predictions  and  the  origin  of  the  A/R  set  strategies,  the  biggest 
issue  for  further  developmenl  of  the  theory  is  empirical  data.  The  collection  used  here 
is  primarily  ba.«ed  on  informal  linguistic  acceptability  judgments,  and  some  of  the  judg¬ 
ments  involve  somewhat  questionable  borderline  cases.  Unfortunately,  the  psycholinguis- 
tic  evidence  corxerning  parsing  breakdown  is  almost  exclusively  concerned  with  center- 
embedded  ohject  relatives.  Bcfime  attempting  to  modify  the  theory  to  increase  the  empirical 
coverage,  it  seems  worthw'hilc  to  develop  a  better  empirical  database. 

Ill  the  next  chaptc.,  wc  consider  parsing  difficulty  that  is  cau.sed  by  local  ambiguity, 
r  !her  than  deep  cnihcvlding. 


Chapter  6 


Unproblematic 


A  i-THOUGH  Garden  path  eitects  have  played  a  significant  role  in  psycholinguistics 
/-%  over  the  past  two  decades,  the  appeai  ance  of  detailed  theories  of  a  wide  range  of 
.JL  ^  bfith  garden  paths  (GP)  and  unpiobletnatic  ambiguities  (UPA)  is  relatively  recent, 
staning  with  Pritchett’s  1987  tf:esis.  Tliis  chapter  presents  NL-Soai  ’s  account  of  GP  and 
UPA  phenornenv^.  The  first  section  describes  the  NL-Soar  garden  path  theoiy,  derived 
directly  from  the  structure  of  the  model  presented  in  Chapter  3.  'fhen  the  theory  is  applied 
in  detail  to  the  oT-item  collection  of  garden  paths  and  unproblematic  ambiguities.  'Fhe  third 
section  shows  how  NL-Soar  accounts  for  the  major  qualitative  garden  path  phenomena. 
The  chapter  concludes  with  a  discussion  and  summary  of  the  results. 


6.1  I'lie  NL-Soar  theory  of  garden  path  effects 


rhe  NL-Soar  garden  path  theory  can  be  summaiized  as  follows.  Comprehension  is  essen¬ 
tially  a  single  path  prtx'ess,  with  a  limited  capability  to  rt'cognitionally  repair  inconsistencie;i 
that  may  a.nse  in  the  syntactic  siruclure  when  the  wiong  path  is  uiken.  When  lecogmtional 
repair  fails,  a  garden  path  eflect  cxcurs 

Ni_-Soar’,s  repair  mechanism  is  simple  destruaive  repair  i  §3,3  3),  'ILe  mechanism 
cops»s!s  of  the  primitive  imeiance  cunMructors  plus  the  snip  operator  A  repair 

tiapfxms  when  snip  fireaks  an  e.'Cistirig  relation  in  the  utterance  model,  and  the  ut'erance 
mode!  is  reconstructed  with  the  standaru  link  o{X‘rato''s  Ihiough  Soar's  ciiunl.ing,  this 
rerian  p»;x:css  I'leconK's  a  seamless  pan  of  the  iccognitiona!  repenoire  of  c<.inipnrhension 
opciatvM s  (Ci  3.3 ), 

Snip  responds  to  iru:oiisisieucies  tietected  in  the  utterance  mndeS  I  lte  !ncoi!Sis*e!V','!es 
tiiat  ecu  ansit  .;kc: 


!  's i.K  Omen r .  to  ssintfietifist  svutisciiC 


aiTu:  icxi,a;  toki 


sue,. 


o!  muUipic  node-,  to  tf-e  same  ^t^ucl!Jv;.d  posUu 
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3.  Missin  ’atory  structure. 

In  case  1,  snip  ('j.;  rator.s  are  generated  to  remove  relations  to  one  of  the  competing  iexicat 
senses,  in  cases  2  and  3,  snip  C[)erators  are  generated  to  remove  relations  locat  'o  tlie 
detected  inconsistency,  where  ioca'  hs  defined  to  .mean  Aithm  tiie  cont'iir..ing  maximal 
projection.  We  will  see  examples  of  all  these  cases  shortly. 

This  kind  of  repair  can  also  be  characterised  as  cuc-bnsed  certain  structural  cues 
trigger  the  repair  process.  This  tightly  constrained  generation  of  potential  sn.'p  sits  ensures 
computational  efficiency  in  both  problem  space  search  and  knowledge  search  (match  m  tlie 
recognition  memory). 

What  does  it  mean  for  NL-Soar  to  experience  a  garden  path?  dny  garden  path  theory 
must  ultimatelj^  be  a  theory  of  conscious  proce.ssmg  difficulty  that  manifests  itself  as  an  im¬ 
pression  of  ungrammaticall  y.  However,  the  Soar  <»rcijitccUire  does  not  make  commitmerts 
about  what  is  in  conscious  evran  ness  (Newell,  1990).  It  is  dnerefore  necessary  to  make  an 
additional  assumption  about  what  ^ives  rise  to  conscious  processing  aitheuliy  in  o.tuei  to 
maixe  predictions  abv  ui  gartten  path  sentences: 


(14/)  The  NLSoar  garden  paih  assumption:  A  garden  path  effect  af(.‘‘es  in  Hi.- 
Soar  when  ii  cannot  re'-ognitionally  repair  an  inconi:»sten!  luterance  model. 

This  an  obvious  and  s^raighdorwai'd  assumption,  but  it  is  irnportaat  to  be  explicit  about  it, 
since  without  it  no  gi.rden  path  predictions  can  Int  made. 

Given  this  assumption,  ttiere  eve  three  po.s.sible.  ways  that  garden  path  effects  might 
emerge  in  NL-Soar; 

1 .  The  appropriate  s'l  ucturai  cues  are  not  available  to  trigger  the  repair. 

2.  The  syntactic  relalioms  that  must  be  altered  (snipped)  are  no  (ouger  avaijab-'e  m  the 
A/R  set. 
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6. 1  Ti'c  NL-So'-ir  theory  of  ^’yjoicr  piUh  effects 
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Figure  6. 1 :  Repairing  an  unproblematic  subject/object  ambiguity. 


.A.ny  impression  of  ungrammatical ity  that  arises,  no  matter  how  rapidly  it  is  recovered  from, 
is  interpreted  as  a  failure  ot  recognitional  comprehension,  and  thus  a  garden  path  effect. 

As  ati  example  of  how  the  theory  predicts  a  garden  path,  consider  the  contrast  between 
the  following  ooject/subject  ambiguities:  (148)  is  perfectly  acceptable,  while  (149)  causes 
a  garden  path  effect  (Frazier  &  Rayner,  1982).  Recali  that  all  unproblematic  ambiguities 
are  represented  by  a  pair  of  sentences,  neither  of  which  causes  any  difficulty'. 

(148;  (.iPAi)  (a)  Thad  !:nows  Shaq. 

(b)  Thad  knows  Shaq  is  tall. 

( !4‘);  GP!  j  Sifice  .lay  always  jogs  a  mile  seems  like  a  short  Ji.stance  to  him. 

Figiireb.l  review.s  the  repair fc-r(  !48,\  which  vrus  Itrst  de.scribed  in  §3.3.3.  Shaq 
i.^  inilialb'  attachea  in  coniplement  position  ol'A:mm'.v.  When  is  arrives,  it  is  proicctcd  to  a  CP 
and  attached  m  compleaiefU.  position.  .siMce  knows  can  take  a  sentcfitial  complement.  Now 
there  is  an  nico'caslciK  y  ni  (he  phrase  siiucture:  two  iodes  occupying  tlie  same  strucUiial 
po.sition.  snip  i.s  geaerated  to  break  a  sirucuiral  reiatiou  local  to  the  iticonst.steiicy:  the 
con'pknie.rit  iflation  between  1  v-  knows]  and  l.v;- .SVinq j.  Next,  \,^’/•  Shaq]  is  attached  iii  its 
projier  iiiial  location  as  die  subject  of  (/i  /v|.  The  hoxeu  nodes  m  the  Figure  6.1  identify 
the  ma\iiii;'l  projes  tioi!  in  which  rlie  inconsistoiicy  istielcctcd  In  (dher  words,  these  nodes 
deitnutthe  .-.cofK' of  consideration  loi  gc.<K,'-iaSing  snip  operators.  Only  relations  that  invidve 
o/ic  )t  shcivC  t’lodc.s  are  cnn.ai.ieii  (i  ior  snipping. 

in  (  i49),  ii  itiile  'S  taken  i:ntia>u^  as  the  coniiticinent  of  /oev,  [ns!  as  m  t  MS),  Because 
/C'Cv  (iC'f  s  not  take  sefuenoid  t  ornj.knnctils,  li.e  iiiniai  phiasi-  Sirwe  Jox  joy.s  is  ad|Oinet!  to 
’hi.*  incc'/caig  '.fi'ins.  hi  taid,  lin.i  is  it',  correct  tinai  pctsilum.  Ilc’ws'vcr,  [/n  vccm.vj  is  sltl' 
fmssii.g  !i.-.  suvijcct.  Bi,;  in  dus  ruse  a  snip  opcivitm  is  nca  gcnci aico  !oi  tiic  i. i>i!g>u'nient 
rcl.ns.iit  ofSw’Ci'n  ;.a,c.i|  and  h,-/'  o  •’/nlcl  because  (he  relatn'ii  is  not  ioca!  to  th"  i led 

ti.sl  ,i  s  IS''))  ,si::  (ir;,'icr  &  k,.v'K-i  i'Se)  lt>i  di  tis  >i  i:i  ,a  w 


ti"  cs  like  I  !  4'!  '  ,1;  s.- 
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V  NP 


jogs  a  mile 

Figure  6.2;  Failure  to  repair  a  subject/object  ambiguity. 

incon.sistency  (the  missing  obligatory  subject).  This  situation  is  shown  in  Figure  6.2,  with 
the  boxed  nodes  again  representing  the  locality  of  the  inconsistency.  As  a  result,  [/v/>  a  mile] 
cannot  be  reanaly.sed  as  the  subject  of  [//>  seems],  and  the  necessary  repair  fails^. 

This  garden  path  tlieory  has  a  number  of  distinguishing  features.  First,  the  theory  is 
formulated  independently  of  what  guides  the  initial  choice  at  the  ambiguity  point.  Thus,  the 
theory  classifies  structure  types  as  potential  garden  paths,  not  definite  garden  paths.  This 
chapter  is  concerned  purely  with  the  repair  process,  which  determines,  given  a  particular 
structural  interpretation  that  is  inconsistent  with  incoming  input,  whether  or  not  that  structure 
can  be  repaired.  Whether  a  local  ambiguity  gives  rise  to  a  garden  path  effect  in  any  particular 
context  is  a  function  both  of  the  ambiguity  resolution  process  itself,  and  the  efficacy  of  the 
repair.  This  chapter  is  concerned  only  with  the  latter  process;  Chapter  4  is  devoted  to 
NL-Soar ’s  theory  of  ambiguity  resolution. 

Second,  the  theory  is  a  functional  theory,  in  that  it  posits  mechanisms  to  efficiently  carry 
out  the  functions  of  comprehension.  NL-Soar  is  not  a  metric  that  distinguishes  garden  paths 
from  non  garden  paths.  It  classifies  certain  sentences  as  (potential)  garden  paths  because  it 
may  fail  to  recogiiitionally  pai.se  lho.se  .sentences. 

Third,  the  theory  embodies  Uie  oinu  tinul  Garden  Path  Hypothesis  t.Sd),  which  stales 
that  GP  effects  are  a  function  of  differences  between  the  syntactic  stnicturc  of  the  preferred 
interpretation,  and  die  .syntactic  slructure  of  the  correct  interpretation.  NL  Soar  embodies 
this  hypothesis  because  GP  effects  arise  from  limitations  in  a  repair  process  that  maps  the 
syntaclic  strueture  oi  rine  interpretation  into  the  syntactic  .structure  of  another  interpretation. 

P'inaliy,  the  mode'  works  without  repraeessitig  the  input  Recogiiitional  repair  happ  ns 
rapidly  and  rather  fns,  ieniiy  widiou'  (he  iieetl  lo  reread  or  rehear  the  mpu'. 

■W’.iiiit'l  .iii.d  1  11. IS  .  (  1 ‘iSV  I  .■clii.-illv  disi fivt'iod  .in  uistaiiec:  ol  a  slum  versmn  i.(  (  140)  iliat  laosl  nt 
Mii'n'i  Iiiuial  art  ept.ibk-,  .is  iin'a.siaco  bv  a  j;i aminalis aiits  jiiilyni'.-nt  task 

!  I  ’4),  (  I  A.’  I  ia)  V's'licii  rile  bos  s  .frik;'  ttie  ;lus!  kills 

..bi  Wlirii  the-  bi)\  ssmk“  the  iKn"  rlu*  i  ai  OMisauae 

NS  ‘‘u  ,11  sill  1  [1,  (sill  Is  tills  lo  be  .!  (  il’  i  )iK'  jj.-'ssiblc  cxpi.inalioii  ct  this  irsult  is  iha!  ihf  ii'ir  .insil  i '  c  use  .a  i !/,'  ■ 
'iiR'in t.t  I'lMk-s  hr  I  -.a  ii  \  ri  s  o!  the  I  la  f  aiisiti ..  c  use  <  >t  e',  Ii't  h  is  requii  n!  !oi  (he  i.  oi  irrl  ar.iA  sis  It 
1  ,  :  (o!  i  !r  Li  ....  ill  ir  ' ! c  h  s  ir,  oi  sa,  |i  .in  isOri  I  sl  ,  iiil.i  I'c  m  Ml  Soai  I  oi  iioss ,  a  i-,  sini  j't  v  .'i  iiiis.st\.l  [iiraii  non 
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6.2  Predictions  on  the  GP/UFA  collection 

To  reiterate  a  point  made  above,  the  predictions  on  tiie  GP/UPA  collection  will  be  made 
independently  of  the  presumed  preferred  direction  of  resolution  of  the  ambiguity.  Despite 
this  independence,  the  theory  is  still  constrained  by  the  data  in  the  following  way.  For 
each  GP  sentence,  there  must  exist  a  grammatical  partial  path  in  the  sentence  such  that 
the  system  cannot  repair  from  that  path  to  the  conect  interpretation.  For  each  UPA  pair, 
there  must  exist  a  single  grammatical  partial  path  such  that  the  system  can  obtain  a  correct 
interpretation  in  both  cases,  because  it  chooses  the  correct  path  for  one  case  and  chooses 
the  wrong  path  but  can  recover  in  the  other  ca.se. 

The  predictions  below  are  grouped  by  ambiguity  type,  with  both  GP  and  UPA  items 
considered  in  each  group.  Many  of  the  predictions  are  illustrated  with  annotated  phrase 
structure  tree;..  A  summary  of  the  results  appears  at  the  end  of  the  chapter. 


6.2.1  Object/subject  and  object/specifier  ambiguities 

We  have  already  seen  how  NL-Soar  handles  two  cases  involving  object/subject  ambiguities 
((148)  and  (149)  above).  This  section  e.xplores  a  range  of  cases  involving  direct  and  indirect 
objects,  prepositional  objects,  NP  specifiers,  and  clausal  .subjects. 


like  (148)  can  be  rather  extended  without  causing  difficulty  (Pritchett,  1992); 

(151;  1/Pa:?)  (a)  Ron  believed  the  ugly  little  linguistics  professor. 

(b)  Ro.n  believed  the  ugly  ii'lle  linguistics  professor  he  had  met  the  week 
befoie  in  Prague  disliked  him. 

As  kmg  as  the  compicinent  relation  assigned  by  the  relevant  verb  (m  this  case  believe)  is 
still  available  m  the  A/R  set,  then  NL-Soar  i.an  repair  the  structure  regardless  of  the  length 
ot  the  object  nounphrase.  The  repair  mechanism  is  ('iily  sensitive  to  tiie  syntactic  structii'C, 
not  the  surface  string 

However,  Warner  a.od  ( iiass  (  1 687)  did  maiidgc  to  pioduce  an  appaienily  length  induced 
garden  patli  etiect,  using,  cxactiv  the  same  ohiecl/siibject  ambiguitv: 

(  !  ,52;  OF?)  riir  girls  I'.clu've  the  man  wtu)  believes  tiie  very  strong  ugly  troys  slru  'k  ihe 
dog  killed  (he  cals. 

Sill  pi  isingiy,  N 1 .  Soc.r  accounts  tor  such  gaiden  path  eftects.  fhe  iiiiei  veiling  niac.ii.i! 
IS  such  dial  all  tlie  striatural  iclatnnis  rcquircii  tor  a  siicce.'-sl ul  rep.ni  i-annol  he  held  m 
d'e  .‘\/!\  .set  in  i!-'.’).  /.hr  ntitn  .iiui  the  hinw  art'  I.lkt'ii  as  t'oriipIt.'ineiUs  td  hrlievr  and 
/)r7i(  I'c.s,  it'specirreh'  When  v/r/iik  aiiu'es,  it  niiisl  Ire  placed  tni  the  i  t>n!|  t  ’  assigncis 
lel.Uion  so  dui!  It  Ilia',  i.ik-'  its  coiuplenieiit.  Helieves  inusl  ,ilsc  still  be  at.ulabic  on  (tic 
laimp  \  '  I'-lalioM,  because  the  complenieut  oi  ht'lieves  must  c.dange  tuuu  ilie  tuow  lo  \[>  luk- 
t  lovt  '.'t  (  I  iiius!  i.'  •  on  the  lariiip  leiatioi.'.  bccaii'a'  tlie  toiiipiianeul  ot  /aa'/fa  ,' 

must  hi'  clumgcil  doin  //ic  nnin  it>  !ne  nuin  kdled  1  t'l  (he  lei.'.iiis  lo  he  suet  esstu!.  Ilr  .A/R 
set  needs  It'  siip[niii  ihe  it'liowing  siiut  luie 
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ASSIGNE-'.J 


comp-V’; 


fv/'  believes],  [v  believe\,[\/i  struck] 


This  exceeds  the  posited  capacity  of  two  nodes  per  relation  (Chapter  5),  which  means  that 
at  least  one  of  the  complement  relations  will  not  be  available.  Thus,  the  interaction  of  the 
repair  mechanism  with  the  limited  syntactic  working  memory  produces  a  garden  path  effect 
because  the  relevant  structural  relations  are  not  available  to  snip.  The  important  factor  is  not 
the  length  of  the  intervening  material,  but  the  interaction  of  the  structure  of  the  intervening 
material  with  the  structure  to  be  repaired. 

Another  kind  of  difficulty  object/subject  ambiguity  arises  with  embedded  subject  sen¬ 
tences  (Gibson,  1991); 


(153;  GP3)  Aarti  believes  that  John  smokes  annoys  me. 

[cp  That  John  smokes]  is  taken  as  the  complement  of  believes  but  must  be  reanalysed 
as  the  topicalized  subject  of  [//>  annoys]  (see  §5.2.3  for  a  discussion  of  the  syntax  of 
subject  .sentences).  A  similar  A/R  set  overload  explanation  can  be  given  for  this  garden 
path.  Believes,  smokes  and  annoys  are  all  placed  on  the  comp-V’  assignors  relation,  and 
assuming  that  annoys  displaces  the  less  recent  believes,  the  crucial  complement  relation 
from  \v'  believes]  to  [cp  that  John  smokes]  cannot  be  repaired.  Unlike  the  explanation  for 
(152),  this  requires  an  assumption  about  the  .strategy  for  managing  the  comp-V’  relation 
(an  assumption  that,  nonetheless,  is  implemented  in  the  system  and  is  consistent  with  the 
other  results  presented  here).  However,  it  is  not  clear  that  a  processing  explanation  should 
even  be  .sought  for  the  unacceptability  of  (153),  because  the  unambiguous  version  of  the 
construction  is  also  unac  ceptable: 

( 154)  ?Aarti  believes  that  that  John  smokers  annoys  me. 

I'hus,  wliile  an  account  is  possible  with  Nl.-Soar,  the  dubious  grammatical  status  ol  these 
embedded  tensed  subject  sentences  makes  the  point  .somewhat  moot. 

Fronted  clauses  with  emhediied  relatives  can  also  pioduce  garden  path  elfects  (Warnei 
A  Glass.  !‘)S7): 


'  155;  (aM)  Hdore  the  boy  kills  the  man  the  dog  bites  strikes. 

l:\ci.'  il  the  clause  the  iloy  f  iles  is  piopciiy  mtcipietcil  ..'s  a  reduced  relative  modilying  ‘he 
man,  when  suikes  ai  recs  and  is  adjoineil  to  the  fronted  cluu'  ■  fas  in  Figure  tc.’ ),  the  le  lev  ant 
snip  operato:  is  not  generated  to  detach  the  man  h  a;'  kdl:.,  lui  tin'  re...soiis  given  above 
111  t'savnplc  (  I  FJ).  Hceausc  ihc  inisniicpretation  ol  iiic  icduced  lelalivc  is  not  iicccssai  iF, 
the  ciilic.il  hii  ior,  the  thenrv  ccmi  llv  picdicts  ib.ii  the  garden  path  w  ill  jieisisi  even  il  ttie 
lel.ihve  chaise  i  ,  disaniliignaleil 

I  I  So  !  Ih'  i  Ol  c  I  he  I  >;  's  kills  die  men  ih.n,  the  dog  bites  sii  ikcs. 

W.uiic!  and  v'l.iss  I'lodir.ed  ..in  inlen.'simu  'wist  on  this  garden  padi  lhc\  in.iii 
(e  L'oni!  \!v  IK  iiidiae  liic  \i<r;  .  '  le.idinr’  ot  iFe  .niihn'iions  Nl’  \s  a  rcsiilp  an  odu  I'.visc 
■  i  .a  ..‘pi.ihie  isinsliui  iion  hccaine  a  g.iH.ien 
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Figure  6.3;  a  reverse  subjectyobject  garden  path. 


(157;  GP.5)  When  the  horse  kicks  the  boy  the  dog  bites  the  man. 

Because  the  theory  can  be  applied  independently  of  the  initial  choice,  we  can  determine  if 
NL-Soar  predicts  a  garden  path  effect  when  the  boy  is  npi  taken  as  the  object  of  kicks.  In 
fact,  it  does.  The  hoy  the  dog  bites  is  ttiken  as  a  complex  NP  with  a  relative  clause  modifier. 
The  man  is  attached  as  the  cornp'ement  of  bites,  displacing  the  posited  object  trace  (via  a 

snip- . we  will  see  other  e.Kamples  of  repairs  involving  traces  in  §6.2.5).  At  this  point,  the 

end  of  the  setiterce  is  reached  but  no  snip  operators  are  genei  ated  since  there  are  no  local 
incon.sistencies  (this  is  true  even  ii'  the  hoy  is  finally  attached  as  the  complement  of  kicks). 
This  situation  is  shown  in  Figure  6.3. 

Objects  can  be  reanalysed  to  NP  specifier  position  without  difhcult  /; 

( 158;  l)P.A4)  (a)  Without  her  we  failed. 

(hi  W'hhcHit  liei  contribetions  we  failed. 

( 159;  UP.A..S)  (a)  The  cop  saw  her 

(h)  The  cop  saw  hei  .sons  employees-. 

'Phe  stiucturc  m  ( !.59j  involves  reanalyias  of  an  ambiguous  jjlural/geni!! ve  NP  from  tibjec! 
to  speei'ier  position;  (  !5H)  involves  a  pronoun  tliat  can  take  ofi|cc(ive  ev  gemtive  c;.i,sc. 
rignre  6. 4  shows  tfu'  detected  meonsistenev  that  arises  a,nd  the  sub.seijUent  repaired  slruciuie 
fi..i  (  !.58). 

A  siniila!  ambiguity  docs  o.nisi  a  garden  path  effeei  (I-'ra/icr,  1978;  Piuchcti,  1988): 

!  !  (lO,  ( il’ii ;  Without  tier  ci'iili iluMious  i aileii  to  nr 


Her  ;s  iiuHaiK  iaken  a.s  ihe  x'hieoi  ol  wahou!,  iJse.!;  reanaiy 'er!  ,is  the  sjiccttier  ol 
Ixi-  hi'f  I  (Oiini’itii.’usl  as  in  (IShi  arn^ve.  '!  lien  the  I'P  l/  j- a  /aa  raei/ iinnions]  is 

adpanuai  lo  |,  ■■  jiii!ed\.  To  perfonn  the  i;ori'-c(  rejcair.  .".mp  ojier.it  ns  mns!  lie  s’cncr.ifed  to 
n  [lu  iM-  j  ,  h/'!  i  bom  sjHai  tii'r  position,  aiul  i  a  move  i  ve  <'  'nlrihinu  ois  1  *  i  i  im  eiPjCi  t  posit  in  le 
/\s  HI  ex.iiiipli.'  i  i49),  these  snips  are  ito!  genetated  sni;.  e  iluo  sie  not  i.e.;)!  tci  the  detcesesi 
iiu;vins[vtc‘ne\  (the  niissuig  MUgeet  oi  g.  /oi/t  /j) 

I  hi!  den  i  si!  ii  e!  I  (,v  vs  can  also  Sh'  err  ate  d  A'ltii  di  mi  i  oioi  ■  it  meni  v  e  i  hs .  d:  !!e  he  tf  P  togi 


;';.T;rrrrr  {M.fiito}  Ivxi  w.  iihfMH  (iU‘  -Mu ipfic 
i  \  .’Diiiut  p.i  '•phv.'s  n 


.ip.vi  i-sU  i  tplir 


Jin. 
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Figure  6.4:  Repairing  an  unproblematic  object/specifiei  ambiguity. 


IP 


convinced  her  professors  h.ue 


Figure  6.5:  .4  garden  pa>h  involving  a  double  complement 

( 161 ;  GP7)  1  convinced  her  profe.ssor.s  hate  me. 

( 162;  GPS)  The  doctor  warned  the  patient  would  be  contagiou.s. 

Convinced  tak.es  two  complenients:  a  nonunal  first  object  and  a  -.entcntial  second  object  (/ 
convinced  her  that  professors  hate  me).  In  (161),  [  a'/'  her  professors  j  is  taken  as  the  first 
coniplemenl  oi  <  on\  oiced.  When  hate  arrives,  i!  is  projected  to  CP  and  attached  as  the 
second  conipleineru,  as  shown  in  l  ignrc  o  .S.  Because  the  two  structures  are  mn  occupying 
till-  same  sin.c  tinal  position,  no  snip  operator  is  gcneraie'l  to  detach  j,v/'  her  pro/cssorsl  and 
the  repair  lads  A  siniilar  explanation  holds  lor  t  162)  The  ddierence  is  that  die  notuinal 
1 '  m  j pic n lent  of  warned  is  opt lonal  (  fhe  doctor  warned  he  woaid  not  tolerate  the  disrupiton ) 
i  'he  patient]  is  nidially  attached  in  tirsl  CiMiiplenK  til  position,  and  (,  /  woahi]  attached 
It!  second  coniplement  position  'Hh*  repair  thus  bills  as  in  1 161 ) 

Aiiothei  kind  o!  gartlcn  patti  arises  when  the  tnsi  object  of  a  doufile  oh)cr  t  constmctioii 
IN  int.dihed  liv  .1  iclaiive  clause; 

:  Itch  (li'a)  jdhn  gave  the  i'o\  tlii-  dm’  bn  a  dollar 

In  I  !(t  ■•  !.  r/ii'  Jm.'  is  iiiitmlh  .itukhed  as  tl'ic  second  cihjcci  ut  y.n  r  ios  is  [>'o|cc(Cii  io  ( 

Inn  rMiHi'''  ,Uiach  in  tlic  cxistnig  stinctuie  Nu  sfii[i  opci.itors  aie  gciicialcil  sua',.-  ih.'!'..'  uc 
no  iiH  .d  inconsistencies  When  n  doUea  arrrees,  n  is  ail  ached  as  the  conudcnicn!  ot  Int.  bcit 
iha!  onlv  e\acci  h;iles  the  garden  path  eltcct  bigine  f>.(i  sfiovv  s  die  iinal  icsiih 
’'■■oi  ,td  ..unbigunics  no  olv  ing  double  of’iects  cause  tldlicnllv 
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Figure  6.6;  a  double-objecUrelative  garden  path. 


(164;  UP\6)  (a)  The  cop  gave  her  earrings. 

(b)  The  cop  gave  her  earrings  to  the  dog. 

Figure  6,7  traces  what  happens.  In  (i64b),  her  and  earrings  ;ire  initially  attached  in  first 
and  second  complement  positions,  respectively.  When  to  'Arrives,  it  is  attached  in  .second 
complement  position.  This  triggers  a  snip  to  remove  earrings,  '^fext,  earrings  is  attached 
in  first  complemern  position,  triggering  a  snip  to  remove  her.  Finally,  her  is  attached 
as  specifier  of  earrings,  and  the  repair  is  complete.  This  is  the  first  example  of  a  repair 
involving  mote  than  one  snip. 

O'ojeci/ohject  ambiguities  arise  when  a  nounphrasc  may  be  interpreted  as  the  object  of 
an  embedded  clause  or  the  .second  object  of  tl.e  main  clause; 

( 16.‘i;  Gf'lO)  Anurag  gave  the  man  who  was  reading  the  book 

Sentence  (165)  does  give  ri.se  to  a  garden  p  nh  cf+^'ct  if  me  amlnguous  NP  (the  hook)  is 
incorrectly  taken  as  ihe  complement  of  the  lower  clause.  A'^  Pritchetl  ( 1992)  poin!:,  out, 
prelerences  for  how  ihe  ambiguity  i,s  re.scdved  vary  cons'derahiy,  so  the  consiruction  does 
nol  |.■,■odaLe  ganicn  path  effects  as  rc!ial)ly  as  niiiny  of  the  other  consti  uciions  discu.ssed  in 
tills  section  However,  that  is  irrelevant  to  applying  ihe  NF  Soar  tfieory.  I  he  tpresiion  is  // 

tlic  imal  Ni’  is  taken  as  the  object  olThe  lower  clause,  does  a  garden  pafti  eite  ,;  ■, merge''  In 

fact,  it  does;  Figure  6.8  siiows  the  result  Mu*  explanation  of  the  elfewt  is  sn.cpc  th  ■  t  iroe 
are  no  ioeal  mconsi‘,le.ncies  i;.>  togger  tlie  .siup  ofKTatoi,  so  no  rcr.iair  t,akes  pl.a  e 

Noi  ail  obiect/otrjcct  arnhigiuties  lead  to  garclen  iraih  ettecl.s,  (.  hiprolalemat  ic  oh 
i< ft/uliject  amiugiulies  may  be  constructed  with  coi'iniemcnts  (sf  noniinals  \(.iibs‘,rn,  1991  i 

(  ItiO;  i  (’.a  n  Cl)  t  he  numslef  warned  the  president  ot  tile  liange! 

ids  I  he  nniMsier  uauie'i  the  jires  ueih  ot  llie  repuhlK;  id  die  viangri 

III  (  Ifrf'la),  I,.,  .  >/ dif'  'vp.’id/a  I  mav  ta,'  nutiaily  ariactieil  as  .s  i  ongacment  oi  \  \  ni\l\ 
Vv  hen  i  /s’  ct  hir  ati’n'i  ’  1  -n  1 1  ves  .uu!  is  atlaehed  in  -  he  sanac  s  o;H[;!cn.en(  posii  ion.  i!  ti  iggers 
.1  oiip'  oj'i.'i.itoi  10  remow  \ri-  ■’/  siu'  nfc '.uh:n:\.  svluch  is  dien  .it!aeliei.i  .is  ,i  compkaiicril  (,i! 
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Figure  6.9:  Repairing  an  uiiprobietnatic  complenient/adjanct  ambiguity. 


6.2„2  CoEU-plenient/adjisiict  ambigoities 

Incorninj?  phrases  can  tvitcn  be  interpreted  as  either  compietnenls  or  adjuncts.  Consider  the 
unproh'lenialic  senlences  in  167); 

(  167.  L’PAHj  (a)  Is  the  Idock  on  the  lable'^ 

(h)  Is  the  Idock  on  ihe  iae!l;’-  red'.' 

The  prepoivtitona!  phrase  on  ihe  ttitie  may  I'C  inlcrpreted  as  a  luodtliei  ol  hitnk  or  llie 
Lornpietiiciit  ol  i.s.  Assume  th;i!  Ihe  conipl'."ini‘t!i  a'tachnicn!  is  pursued  hist.  Figure  6.6 
shows  how  N!  Soar  repairs  the  struciiue  wlieu  red  anives,  so  dial  the  I'F  b/.  on  ihc  table] 
IS  reanalys  'd  a^.  a  inotliiier  ol  hl<>ok.  Rro  is  projeeie.,!  :«.•  .i,t!  .\I’  aud  ailaehed  in  eoin[iIemenl 
[rosiiion,  whieli  tiiggt'rs  a,sm|.i  operalor  to  deiaeh  i  ,-,"  at  dw  ho.\  |,  ni  tiie  same  manner  that  we 
ha\(‘  seen  abox  e.  ( lin  e  dsiai.  lied,  (/•/.  on  tiv  rab!>  \  s>  simi',l\  ad)oined  to  | blot  k  |.  .A  sinulai 
explanalii'ii  piv  ^il^.ls  the  aeeeptahiiitx  oi  complemeiil  eiause/subiect  lelativc  amluguilics'’ 
i  X  iibs’ >i) .  I  '.'6  !  ' 

(ItiS;  Id’Wi)  (.i)  ioliii  told  ihe  man  that  Matx  kussrai  Bill. 

(b)  lohn  told  die  man  (liat  kissed  Maiv  lhai  Bill  saw  I’liil 

llowexei.  odiei  eoiiip!emeiil,'ad|(iiiel  ambieuilies  do  e.ur^e  dillK  ullx.  Crain  and  Sieed 
(11,111  (  toniui  lh.it  eoni(>leiiieiil  r  ian.se/obi'sel  lelalixa- .iintegmlu-s  pnaiiu';  ■.  isieii  p.ith 

elKs  Is: 

i  1  ('6,  ( ,l'l  1  I  i  In;  I'sx  eholopisl  told  itu'  xx  ilt:  that  he  xx  as  I).  .  ’o,.  i , a  mi'iie  \x  ilh  lo  ksix  e 

i’re|);tsi!io!ial  pinasc  ;uguiuenl/.kljlii  i.'i  imi'iLunlie'  niav  also  produi.a-  iMiden  [' ah  eltei  ls 
i  111  SOlllKisI  lo  (  I  (1  I  al'oxe  )  ,(  is'):  .11,  I'  "’  I 

i  ill'  n  1 1.1 11  I  ol! hi  in'  ,i'o  ii  Ji.'d  I'.Ml n  fl I'-i  !o|  lou  ini'  I 'i  i iOi.’i I  ■-  i  !  ‘hi  i  .o-.i im ■ ,[ loi i  li . .li  i  i m-  , ,>1,  1 1,  ,■  p,,,  J 
ti  n  f  I  I  [  lie  i  I  X  1  '  !l6l  !  I  IICIK  n  .'(C  I M  t'hcnl 
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( 1 70;  GP12)  I  sent  the  letters  to  Ron  to  Teresa. 


Unfortunately  NL-Soar  cannot  account  for  these  garden  paths,  because  the  repair  succeeds 
in  both  cases  as  in  ( 1 67)  above.  However,  the  complemenUadjunct  ambiguities  exemplihed 
by  (170)  seem  to  differ  in  their  acceptability; 

( 171 )  ?Michael  put  the  toys  in  the  bag  into  the  closet. 

Such  examples  demonstrate  the  complexity  of  PP  attachment  phenomena. 

Crain  and  Steedmaii  ('985)  al.so  found  that  a  garden  path  aii.ses  when  the  clause  is 
interpreted  as  a  relative  clause  hut  the  complement  reading  is  required  (they  induced  the 
relative  reading  through  a  contextual  manipulation): 


(172;  G1’13)  The  psychologist  told  the  wife  that  he  was  havii, g  trouble  with  her  husband. 

Nl ,  .Soar  does  account  for  tlii.s  g.uclen  path.  Hveii  9  1/v/'  her  imshatid]  is  attached  as 
the  complement  ol  !/•'  wuiii,  die  entieai  sniii  ojrerator  reijuired  to  detach  the  clause  from 
fv  vei7('|  IS  not  generated. 

(ribson  (ld‘)|)  ptunls  out  the  lollovvr 'g  m  •■ohicm.itic  ambiguity  involving  eornple 
iiieiits  o!  nomiiuils: 


(  ! 7U  l)f>Aioi  (;i)  t  he  report  that  one  president  .sent  tr>  us  helped  us  make  the  decision. 

(h)  i  lie  tejxiM  tiiai  llie  juesideat  scut  the  troops  into  combat  depressed  me. 


I  he  elaiisi.'  llhU  tht  /nesiilt-  i!  .sera.  iiiav  la*  taken  .is  the  eoinpienieiil  or  modilier  ot  re/>i-/!, 
.iiiil  iieithci  iiiieipiel  ioosi  v  .arses  dillieiillc  .NI  ,5o.n  !,mI\  i>>  predict  this.  It  tiie  clause 
IS  all. H  lied  .IS  (!  c  eoiuplenK  iit,  llie  lelec.iiit  snip  (a-  bie.ik  ttie  i  oiiipien.eiil  link)  is  not 
'.'I'iier.iled  w  iieii  die  nussiiu,'  oiriect  ol  ve/.r  a.  detc'clr'd,  t'eisiu'.e  die  relation  is  luU  local  to  VI’ 
Iw  \(7if!  .Sinai. ul\,  iKiihiFig  triggers  ilic  snip  to  break  tiu;  .u1)11!K  tioii  lel.itioa  it  tiu*  clause 
IS  fiisi  aiiactieit  .o  a  iiioditiei 

.A  nod  a*  I  k  md  ol  iin|'iobleni.i'r,  eoitipini  lent. did  .im'i,  "nts  on  ol\  i.  \  adga  1 1\  fs  ih.il 

Mi.ic  be  !.ik(  II  .IS  pi eda  .lie  (,  oinpleineiUs  Ol  iiioditieis  ol  iiKa>iiiing  lu  ains 

(  !  .  I .  I  !’  \  III  ui  I  I  he  b-  n  g‘  >1  l.il 

d'l  1  Ik*  h'.o  eol  l.il  r.iae  loi  his  pet  snake, 

I  i'Miie  ti  !(l  sluo'.s  hov.  \1  So. II  I'ep.urs  sia  !i  i  oust!  tu  I  ions  j.,,,  |s  .lU.u  lied  .a.  iiit' 

.1  a  hf  'f  Ilk  111  o!  I  i  g,  ’  / 1  1 1  i  kge  ling  1 1  a*  snip  kd  |  , /o/ 1.  \\  hail  is  dkii  .uta  a  ne.  i  a  >  1  \  "u-.  c  | 
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Figure  6.10;  Repairing  an  unproblematic  predicate  complement/modifier  ambiguity. 

6.2.3  Main  verb/reduced  relative  ambiguities 

We  now  explore  variations  on  the  reduced  relative  garden  path.  Consider  the  canonical 
example; 

(175;  GP14)  The  horse  raced  past  the  barn  fell. 

Figure  6. 1 1  the  shows  complete  .structure  for  the  main  verb  interpretation  of  The  horse 
raced  past  the  ham.  The  inflectional  features  that  head  the  IP  phrase  adjoin  to  the  zero- 
level  verb  node,  leaving  a  trace  in  head  of  IP.  (This  joining  of  inflectional  features  to  the 
verb  is  assumed  in  some  form  by  most  syntactic  theories;  e.g.,  McCawiey  (1988)  calls  it 
Tense-hopping,  and  assumes  an  adjunction  structure  like  the  one  presented  here.) 

Passive  forms  like  driven  are  untensed.  Figure  6.12  shows  the  reduced  relative  reading 
v)f  The  horse  raced  past  the  barn,  which  uses  the  passive  interpretation  of  raced,  in  tins 
structure,  the  inflection  plays  no  role. 

Consider  the  repair  required  to  successfully  parse  (175).  The  structure  in  Figure  6.! ! 
must  be  transformed  intc;  tnc  structure  in  Figure  6. 1 2.  This  involves  removing  [/v'p  the  horse\ 
from  spec- IP  position,  and  snipping  the  adjoined  inflectional  features.  When  /e//  arrives 
and  is  projected  to  VF,  die  only  place  it  may  attach  is  di  complcinciU  position  of  F.  This 
laroiluces  an  inconsi.stency  local  to  the  IP,  as  shown  in  Figure  G.13  However,  tnis  fails  to 
generate  ail  tiie  required  .snips.  Although  the  spec-lP  relation  is  local  to  the  IP,  the  csiiciai 
mfiedioi]  .'idjirnction  is  not,  so  the  passive  reading  cannot  be.  recovered. 

The  intervening  modifier  (/-/.  pact  the  barn]  is  irrelevant  to  this  explanation.  Thus,  NT, 
Soar  ra.  iiectly  piedicts  the  exi.sfcrice  ol”  very  .short  rediiecd  relative  garden  paths  ( Kurtzina.n, 
Abney,  1959)’. 

’( *ti(‘  fidssibiluv  tier  ■  .  m  allow  [  v  snaAi  lo  attach  I'S  the  head  ot  the  exiNiiiii;,  Vi  i  ]  v;>  fhiawd]),  wliich 
wMiild  thi  II  iii.ikc  llic  Vi  laihci  than  tin-  it',  the  loculitv  u!  the  iiiCoiiMstencv,  Itut  ulhtwinsj  such  iiiiKS  extends 
the  i.iiiye  id  cMstiny  coii-.inict,i ve  [)ioi..esses  '.lie  hasR  asaiinplion  ut  siinpie  dcslniclive  |.■‘|)an  is  that  the 
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■IGURf.  6. 1 3:  The  main  verb/reduced  relative  garden  path. 


(176;  GP15)  The  boat  floated  sank. 

The  exulanation  also  extends  to  ditransitive  verbs  (Rayner  et  al.,  1983): 

( 1 77;  GP16)  My  friend  sent  the  flowers  smiled  broadly. 

Embedding  the  relative  clause  gives  rise  to  a  similar  kind  of  difficult  ambiguity  (Gibson, 
1791): 

( 1 78;  CP!")  The  dog  that  was  fed  next  to  the  cat  walked  to  the  park  chewed  the  bone. 

In  (178),  walked  must  modify  cat,  but  unlike  (173),  a  different  NP  is  in  subject  position 
(dog).  Again,  however,  when  chewed  is  projected  to  VP  and  attached  to  the  main  clau.se 
IP.  the  required  sntp  opci.itors  are  not  generated. 

Not  all  main  verb/reduced  relative  ambiguities  produce  garden  path  effeets  (Pritchett, 
1992;  Gibs.!n,  1991): 

( 179;  UP.M2)  (a)  The  defendant  examined  the  evidetice. 

(h)  Tlie  delendant  examined  by  the  lawyer  shocked  tlie  jury. 

d  he  rcdceed  relative  readi.ag  m  !l79b)  is  readily  avadabie,  in  stark  contrast  to  (  !75) 
Nl  .  Soar  handles  ihe  repair  m  llie  tolluwmg  way.  Ihc  incoming  laeposilioi;  h\  jjic'iect  ^  to 
PI’  and  ;id)oins  to  exiintined.  just  as  pa.M  ;idniins  to  raced  in  !  173)  The  ciucial  ddiciencc 
IS  dial  CMunitu’d  otiligator'dy  liansilivc.  This  means  dial  |v-  exiutnned]  is  nussnig  an 
object,  vvhicli  must  n  )m!aily  iinmciliatcly  tollow  the  veib''.  lagnic  (>  M  shows  a  snapslio! 

c  X  !.3li  n  L’ set  L  ‘  tfistr  iK't  I  VC  pH  ttvsbCs  I  [>lus  snip }  au*  MiHuicni  Suib.iinc>ut  fht:  htiiit  [(/■  !i  ill  ddd  b 

pi hv  Ihus  Ihc  tnct.?nun*.’  verb  '-vtil  cicatc  ils  (j'ah  |>ii))ccUt)ri  GUhfN  ihan  ancLiniic  a^  ilic 

t)l  .1  '.liiftTfiii  Ncrb  ’-,  |)io|CL!n‘n 

'  lliis  .uliiKciA  V  ff4,|i;ifTnU’n!  ^SitmcliiriCs  aHribnied  16*  (  pm:  a ssipilmu’iIM  .  ^lf)  (“a  il\  be  seen  in  fh-*  a  ’vk 
V.  auiiit^bs  '.i\  examples  snetj  .i.v 
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IP 


IWX,  examine-d 


Figure  6. 14:  An  unprobiematic  reduced  relative  ambiguity. 

at  this  point.  Two  snips  are  generated  local  to  the  VP:  a  snip  that  removes  the  adjoined 
inflection,  and  a  snip  that  detaches  the  VP  from  the  I’.  Now  [vp  examined],  in  its  untensed 
configuration,  projects  to  a  CP  and  attaches  as  a  reduced  relative  to  [/,//  defendant],  and  the 
repair  is  complete.  When  shocked  arrives,  it  projects  to  VP  and  attaches  to  the  existing  IP, 
and  a  new  trace  is  established  in  head  of  IP  coindexed  with  the  new  main  verb. 

Becau.sc  the  existing  VP  structure  for  examined  is  left  undisturbed  during  the  repair 
(with  the  exception  of  the  removal  of  the  adjoined  inflection),  any  adjuncts  to  the  VP  may 
be  carried  over  without  problem: 

(182;  UFA  14)  (a)  The  defendant  carefully  examined  the  evidence. 

(b)  The  defendant  carefully  examined  by  the  prosecutor  looked  nervous. 

6.2.4  Lexical  ambiguitie.s 

ihi.'-  scctioii  cxaniine.s  a  range  of  struclural  ambiguities  that  arise  from  lexical  syntactic 
aiTihigufty  I’hc  concern  here  is  not  with  semantic  ambiguity,  though  of  course  semantic 
arnhiguity  is  nearly  always  involved.  Rather,  .he  focus  is  on  syntactic  ambiguity  that  is 
synl.u  ticallv  re.soived.  Bscause  ihest  constructions  constitute  about  a  tliird  of  the  total 
collection,  they  are  fuith'.'r  divided  into  more  .nianageablc  suligroups, 

;  I'Un  lavvvri  rx.iiniii .it  .vitii  liasif  ilie  ilelciulanl 

f  Idw'cm"  i,,i  itic  c.isc  of  nt-.i'i  V  r>!\.  iht-  pfclcnriK-f  :s  it-vri'.n1: 

(  !  K  i  )  ( ,i  I  '  I  lie  i.iVv', c'  il!>;-  delemtari!  ihal  wc  saw  vxrsleril.iy  m  the  (.'ouithiiuse  wilh  hastf 

I  !>:  t  i SV  i  ' )  I  hf  Livvcr  i-xancncil  wnh  iiasif  tho  ilctciufinl  i.h.il  \v  c.‘  sav,  \  csicnl.iv  in  itir 
Hit  1  h.oi:'-;!’ , 

!i!  i  8  U)  I,  Vi/wio'/r.  /  rnnst  h<-  .nl  troiii  a  rcdiiLCil  relative  hai.  k  In  a  iiiaui  \  ei  !>  Nl  Si  su  tails  li  >  ii.inille 

till  .  lepaii ,  nri.  .iiise  Uu  iiiiienscii  fxtnuitr,!  ,  .innot  heer.se  a  hate  oh|eel  NF 
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Figure  6.15;  Repairing  an  unproblematic  nour/verb  ambiguity. 

Noun/verb  ambiguities 

In  the  following  constructions,  the  ambiguous  region  may  be  taken  as  a  compound  noun 
( 1 83a)  or  a  noun  followed  by  a  verb  (183b)  (Frazier  &  Rayner,  1982): 

(183;  Ul’AlS)  (a)  The  warehou.se  fires  kill  numerous  employees  each  year. 

f  b)  The  warehou.se  fires  numerous  employees  each  year. 

Figure  6. 1 5  shows  how  NL-Soar  can  repair  the  compound  noun  structure  to  be  consistent 
with  the  interpretation  required  in  (183b).  As  dc.scribed  in  §3.8,  multiple  syntactic  senses 
of  lexical  items  are  acces.sed  in  parallel  and  placed  in  the  A/R  set,  where  they  may  generate 
then  own  projections.  The  first  frame  of  Figure  6.15  shows  the  structure  ju.st  before 
i/v7>  numerous  employees]  arrives.  I'hc  nominal  form  o\  fires  has  adjoined  to  |a'  warehouse] 
to  form  a  compound  noun,  and  the  verb  foi  ni  has  projected  to  a  VP  and  tensed  IP.  When 
employees  arrives,  it  atiaches  as  the  complement  of  |v"  /(ms|.  At  this  pomt,  the  two 
competing  senses  ot  lues  are  both  incorporated  into  the  utterance  model,  because  each  is 
attached  to  another  won!  I  his  tuggers  a  siiqi  (ipcrator  to  detach  fires  as  a  noun,  leaving  the 
simple  NP  l.v/'  the  warehouse].  Next.  j.v/.  the  warehouse]  is  attached  in  subject  position  of 
llic  IP  projcc  terl  fioin  fires  as  a  veil),  and  tlie  rejiaii  is  complete 

.A  siniilai  amf'inmtv  aiises  with  itic  auxiliary  run  ((iibsun,  IPAl  i; 

I  IH-f,  I'PAic)  la!  I  he  paint  can  fell  down  the  stairs. 
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(b)  7'he  paint  can  be  applied  easily  with  a  new  brush. 

In  this  case,  can  forms  a  compound  noun  with  paint  and  projects  an  IP.  When  he  arrives 
and  projects  to  VP,  it  attaches  as  the  complement  of  [/'  can],  triggering  the  relevant  snip, 
and  the  repair  proceeds  as  in  example  (183)  above. 

Noun/verb  ambiguities  may  be  preceded  by  adjective/noun  ambiguities  without  causing 
difficulty  (Milne,  1982;  Pritchett,  1992): 

(185;  UPA17)  (a)  The  square  blocks  the  triangle. 

(b)  The  square  blocks  are  red. 

Figure  6.16  shows  how  Nl.-Soar  repairs  the  main  verb  reading  of  blocks  so  that  it  is 
consistent  v/ith  the  NP  structure  required  by  (185b).  Square  projects  both  AP  and  NP 
nodes,  and  blocks  projects  to  NP  and  IP  (1).  The  square]  attaches  as  the  subject  of 
[//>  blocks].  When  are  arrives,  it  protects  to  IP,  and  [/v/-  blocks]  attaches  as  its  subject  (2). 
This  triggers  snip  operators  to  remove  structure  attached  to  the  verb  sense  of  blocks,  that 
is,  square]  (3).  Next,  [ap  square]  adjoins  to  [/,-  blocks]  (4),  triggering  the  removal  of 
structure  attached  to  the  noun  sense  of  square,  that  is,  the]  (5).  Finally,  [^et  the]  is 
attached  in  specifier  position  of  square  blocks],  and  the  repair  is  complete  (6).  This  is 
another  example  of  a  repair  requiring  multiple  snips. 

Some  noun/verb  ambiguities  do  cause  difficulty.  If  the  unproblematic  ambiguity  in 
(185)  is  followed  by  a  reduced  relative,  the  result  is  a  garden  path  (Milne,  1982;  Pritchett, 
1992): 

(186;  GP18)  The  building  blocks  the  .sun  faded  are  red. 

Blocks  is  taken  as  the  main  verb  (as  in  (185),  and  ,sun  as  the  complement.  When  faded 
arrives,  it  can  be  attached  as  a  reduced  relative  modifying  sun.  Once  are  is  projected  to 
an  IP,  no  additional  attachments  are  possible  (the  nominal  sense  of  blocks  cannot  attach 
as  the  subject  of  are  at  this  point  becau.se  the  NP  blocks]  is  not  adjacent  to  |//.  r/o'j). 
Furthermore,  there  are  no  local  inconsistencies  to  generate  a  snip,  so  the  repair  is  never 
initialed. 

Another  kind  of  difficult  noun/verb  ambiguity  does  not  involve  a  reduced  relative 
(Milne,  1982): 

( 187;  (iPI9)  The  granite  rocks  by  tlie  seashore  with  the  wave.s. 

.Assiiiiimg  ill  this  c;ise  tfia!  tlu'  noun  iiiteiprctiition  of  rocks  is  pursued  first,  the  entire  stmig 
mav  be  ttssigned  a  well  loriiied  slructure  as  a  complex  noun  [ilirase: 

(,vc  ih.c  x^ronitc  rocks  |/./.  l>\  1^/,  the  seashore  I,-./,  with  the  uYO'e.sj  j|  I 

1  hos,  no  iiK onsisteiicies  aiise  to  generate  the  snip  ofieratoi.  fins  anttlysis  is  eonsistent 
U  illi  the  inlunoe  feelnig  thal  tfie  problem  wilfi  (  i9(.t,l  is  not  so  omeli  uiigraiiiiiialK'ulilv  as 
iiieoini'leU'iiess.  f  xample  ;  ItS:')  prosiuees  a  similiu  effect 
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Figure  6.  i  7;  a  garden  path  involving  a  noun/adjcctive  ambiguity. 


Noun/adjective  ambiguity 

This  section  considers  a  range  of  ambiguities  involving  words  that  can  be  interpreted  as 
nouns  or  adjectives.  We  have  already  seen  in  §3.3.3  how  NL-Soar  could  repair  the  basic 
noun/adjective  ambiguity; 

(188;  UPA18)  (a)  The  square  is  red. 

(b)  The  square  table  is  red. 

Complement/adjective  ambiguities  (Pritchett,  1992)  are  also  easily  handled; 

(189;  UI’A19)  (a)  I  like  green. 

(b)  I  like  green  dragons. 

In  ( 189),  green  projects  to  both  NP  and  AP.  [ni>  green]  attaches  as  the  conrnlcinenl.  When 
dragons  arrives,  it  is  projected  to  NP,  and  \/^^p  green]  adjoins  as  a  modirier.  This  triggers  the 
snip  operator  to  detach  the  nominal  sense  of  green,  and  |w  green  dragons]  attaches  as  the 
coniplenient. 

When  an  easdy  repaired  ambiguity  like  ( 188)  is  followed  by  a  relative  clause,  a  garden 
path  effect  arises  (Marcus,  1980): 

(  100;  (iiu.O)  Fhc  Russian  women  loved  died. 

The  Russian  is  first  taken  as  an  .NP,  then  unproblematically  reanalysed  as  The  U/<  Russian] 
wtnnen,  us  ini,  1 88).  Next,  loved  projects  to  VP  and  tensed  IP,  and  l/vr-  The  Russian  |  women 
utl-u  hes  as  the  suhiecl.  When  Jrv/  arrives,  it  may  attach  as  a  VO’  (o  the  F  compienient, 
but  tins  only  succeeds  in  inggering  a  snip  operator  to  detach  Ivc  u/ved]  (Figure  6.17).  No 
luiUicr  :itiuc|imeiits  arc  possible  (li/.  hived]  caniiol  attach  as  a  reduced  relative  in  its  tensed 
conhgmalion,  as  discussed  earlii  i )  No  fiirlher  snips  are  generated  in  parliculai,  no  snips 
,i!e  geneiated  to  split  lire  innuiphrase  |,v/'  Hie  Russian  women]  As  a  icsull,  the  rcpaii  fails. 

.A  well  known  ilass  of  auilugiiuus  au)eelive  nouns  tio  cause  gaidt.ii  palli  i:ltects.  eviui 
when  iisi.-d  m  sinqde  consiruclions  like  i  188)  above  iMiiiic,  1983); 
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(191;  GP21)  The  old  train  the  ehildren. 

(Another  familiar  example  is  The  prime  number  few.).  Pritchett  (1992)  points  out  that 
the  crucial  distinguishing  characteristic  of  the  problematic  examples  is  that  they  involve 
derived  norninals.  That  is.  the  nominal  sense  of  these  words  is  derived  from  the  adjective 
by  a  productive  process  {the  virtuous,  the  true,  the  free,  etc.).  The  resulting  norninals  appear 
in  restricted  contexts: 

(192)  *We  saw  an  old  yesterday. 

Pritchett  suggests  that  these  words  are  repre.sented  in  the  lexicon  only  as  adjectives,  with 
the  nominal  interpretation  generated  by  some  on-line  process.  In  NL-Soar,  this  means  that 
the  nominal  derivation  is  a  process  of  projecting  an  NP  from  the  zero-  level  A  node  retrieved 
by  lexical  access.  Now  consider  what  happens  in  ( I9l ).  Once  the  NP  [t^jp  the  [4/>  old)  train] 
is  formed  and  old  has  been  projected  to  an  AP,  the  repair  will  fail.  The  adjective  phrase 
<■.»/<:/]  may  be  detached  as  in  example  (188),  but  no  additional  snips  are  generated  to 
destroy  the  projected  adjective  phra.se,  which  is  necessary  to  make  the  zero- level  A  node 
available  for  the  nominal  derivation^. 

Ford,  Bresnan.  and  Kaplan  ( 1982)  used  another  derived  nominal  to  produce  a  garden 
effect  with  a  predicate  complenicnt/subject  ambiguity; 

(194;  Gr’22i  The  boy  got  fat  melted. 

In  this  case,  tlie  AP  projection  oi  fat  can  be  immediately  attachei^  as  the  complonient  of 
got.  When  melted  arrives,  it  also  attaches  in  compleineni  position,  triggering  the  sii'p  of 
\,\r  fot\.  But  melted  requires  the  derived  nominal  reading  oi  fat,  and  the  zero  level  A  node 
is  no  longer  available  tor  the  tiei  ivation.  The  repair  thus  fails.  The  eomplement/snb|ee; 
anibig,uity  itself  is  not  the  source  oi  the  problem,  as  deuioiisl  rated  by  the  loliow  iiig  sentences 
(Prilehelt,  !992): 

(196)  (a)  The  bov  got  the  cake, 

(b)  file  boy  got  the  cake  bakeil 

( 'onstruciions  like  (PfS)  ran  t'c  easily  h.aiullcd  .is  (.Icscribed  earhei  m  the  section  on  oh 
ject/sub|ect  amtriguilies. 

Nl..  Sfiai  i  an  still  liandK'  llic  uiip.'nlileiM.itie  aiiitni’uiiv  hcln'.v 

I  i')!,  I  M'.A.’O)  (a;  I'Ik:  okile.iili  llie  viniin’ 

(1)1  l  lir  i)icl  [)iiilcssOi  (.■ai'tu".  iitleii 

Hmvs.-m-i  il  ni|iiri,  .1  ilelax  i;i  I'inieOifi!'  the  .At'  mini  itieie  e  m  nnethini'  In  ehi.  li  the  pi ;  i|!'i  I  ii  ui  i  .m 

ail. nil  (in  (I'ttlii  i  It  is  iinl  '-.iiqii  isiiit'  fliat  tli,'  m  niiiiia!  piuieitma  i*.  iie.l  m.ule  uanl  the  leiiirsl 

I  kina  IK  Is  II ,  lull  il  i  Ies‘  ele.u  e'cieih  lin’.'.  tins  '.hiiiild  iirteiaet  v.ilh  the  A  I  ’  pn  ijeeln  iii  Nes  ei  theles-  this  eiel.e 
til  les  IK  '!  sei  1'  ir  s: t'eiiiia  i  anise  inimedi  ie\  i  U  iiilei  pi  ei.it  mn .  -,mee  im  .tn,ti:hnie.iits  I) '  i  ilhei  ie  sk.  .ii  pi  i  Jps,  in  ne, 

.  i  I! I la  I  '(■  il i.iik;  111  .111',  e vriil 
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FIGURK  6. 1 8;  Repairing  a  pronoun/deterniiner  ambiguity. 


fhai  ambiguity 

The  word  tiui!  can  play  a  role  a.s  a  delerminer,  protioun,  or  complementizer.  Conside-  ( 1 96): 

{  196;  UPA:  I )  (a)  1  know  that, 

(h)  I  know  that  girl. 

In  (  I9()a),  that  is  an  NP,  wliile  in  ( 1 96b)  tihii  is  a  iletermincr.  Neither  sentcrice  causes  any 
ddliciihy.  Fiinire  6.  IS  .^l.ows  how  NI,  Soa;  liandies  this  repair  All  three  syntaetie  senses 
o\  that  are  retrieved  from  trie  lexicon,  and  tiie  pro|ecterl  pronominal  node  attaches  imlialiy 
as  the  com[!lemenl  of  know.  When  vjii  arrivc's,  k;,.,  diti!\  atlachi-s  as  its  spi’ciliei.  1  his 
triggers  the  smp  ol  |\7'  ihai]  Irom  coinjriemeiit  position,  and  liiially  kv/'  that  pirl\  beconu's 
the  coiiipleiiieiit 

siinilai  uiipiohlerii.ilir:  ambiginiv  nnohes  the  compleiiienli/er  reading  ol  iluir 

(  1 9  /;  \  ii'A.’j  )  ( a  1  1  know  !hat . 

(to  I  know  ihai  tlogs  shonid  plav 

{  l‘.'/i  IS  h.indleu  111  r-sseiilialh  llu-  same  nianiu’i  as  (  I'R)),  When  slunih!  pion'eis  lo  an  IP, 

I i/og.s )  .iil.H-lies  as  il.s  s[ieci!ii  I  Next,  k,- <7o.e.s  .v/ioir/i/i  attaches  as  ilu.'  coin[!leiiienl  o! 
k  /  k  tihitW  riiis  Iiiggeis  lilt’  snip  ol  j lhal\  and  k /■  thai  ke  aoi’.s  .shoaldW  her  oines  llie 

r  ( )mpk‘PK'iil 

Sill  |ii  isingK,  ;/m/  ai  nbignilK’s  can  also  jn  i  iriuri'  gait  ten  ;  >  ilh  rdt  crt  s 


I  kkS,  i  ilC!  ':  l-Sefoie  siu'  kiieu  itial  slie  went  to  ll!'  slou- 
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In  (198),  that  is  initially  taken  as  a  pronoun,  then  reanalysed  as  a  complementizer  for 
[ip  she  went]  in  the  manner  described  above.  Next,  [pp  to  the  5^^^]  attaches  as  the  comple¬ 
ment  of  [v<  vv<?»?l.  The  processing  completes  with  the  well-fo  red  CP, 

(199)  [cp  before  [ip  she  knew  [cp  that  she  went  to  the  store]]]. 

No  snips  are  generated  since  there  are  no  local  inconsistencies,  so  the  correct  structure  is 
not  uncovered. 

Garden  path  effects  can  also  be  produced  with  complementizer/deterrniner  ambiguities 
(Gibson,  1991 ;  Pritchett,  1992); 

(200;  GP24)  I  saw  that  white  moose  are  ugly. 

In  (200)  the  singular/plural  ambiguity  of  moose  interacts  with  the  ambiguity  of  that  to  cause 
the  difficulty.  That  white  moose  is  initially  interpreted  as  a  singular  NP  and  attached  as  the 
complement  of  saw.  Are  arrives,  projects  to  IP  and  CP  (it  cannot  attach  as  the  complement 
of  [c/.  that]  because  the  phra.scs  arc  noi  adjacent),  and  attaches  in  complement  position  of 
.venr.  This  triggers  the  snip  of  |a'/>  that  white  /nofn-c],  but  fvp  that  white  moose]  cannot  be 
atttiched  as  subject  of  |/p  are]  due  to  a  number  agreement  violation.  No  fuither  snips  are 
gcneratci,!  and  the  repair  fails. 

Another  diflicult  coniplementizer/determiner  ambiguity  arises  when  a  sentence- initial 
that  may  he  interpreted  as  a  delenniner  or  as  a  complemcnti/er  for  a  subject  sentence 
(Gibson,  1991); 

(201 ;  (iP25)  I  hat  cotlee  tastes  terrible  surprised  John. 

ilia!  coffee  r.s  interpreted  as  an  Nl’,  which  can  then  seive  as  the  subject  ot  tastes  ternhh  . 
Wlien  sioiuiscil  ai liv  es,  it  eaunol  take  the  iiiitnil  clause  as  a  Mib|ccl  sentence  because  siibieci 
sentences  must  have  ovi-ii  coin()leiueiiti/ers; 

(2().’i  bSaiah  lett  bothered  .lili 

I  he  niilv  oplnni  is  to  piu|ecl  sio i>ri.\f,l  lo  \'i’  and  attach  it  as  tiu'  coinplenient  ul  ihe  cMstiii;’ 
IP.  l’hl^  leiuls  iK'w  here,  in  p.ulicnl.u.  il  sloes  not  lead  In  itie  ieL|nned  snip  nf  |  ,  ilh.al  linin 

\ \i' thill  Cl  >ttt'('\.  Iheiei'aii  thus  I, ids. 
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CP 


PP 

by 

given  gifts 


Figure  6.19:  a  main  verbAiuxiliary  garden  path. 

Figure  6-19  shows  how  the  garden  path  effect  ari.ses.  Have  rriay  be  interpreted  as  an 
impenitive  main  verb  {have  heads  a  VP)  or  an  auxiliary  that  begins  a  question  (have  heads 
a  CP).  The  question  interpretation  is  pursued  when  l,vf>6ov.v]  attaches  in  spec-IP  position. 
Given  projects  to  a  VP  and  attaches  as  the  F  complemeiil,  and  j,v/.  gifts]  becomes  the  hrst 
complement  of  (r'  givenj.  Next,  !/■/•  /n)  adjoins  to  Iv-  given].  In  this  configuration,  given 
IS  missing  an  obligatory  second  complement.  This  may  trigger  a  snip  of  the  VF  from 
1(7’  have  i//.  |,v/.  ihe  /)ov,sj!|,  but  It  does  not  tugger  the  snip  required  to  remove  |,v/-  the  Aov.si 
*rom  the  interrogative  structiuc.  Furthermore,  the  detached  U/  given]  cannot  attach  as  the 
complement  of  F/'  iiave]  because  t'lc  phrases  •ue  not  adjacent.  As  a  result,  the  repair  failsV 


Singalar./jhiital 

We  have  alteads  seen  some  e  vamplcs  in  which  number  amluguiiv  plavs  a  role  Number 
amiuguitv  ilsell  is  not  nccessanlv  a  [uoblein  (  Kuit/man,  lO.Ss); 

(  J()s,  l  i'.A,’  M  (ai  i'lie  sheep  seem  \eiv  ti.ippv 
(b)  riie  sheep  seems  veiv  haj’pv 
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Because  ihcre  is  no  structural  ambiguity  here,,  there  is  r,o  need  to  lepresent  the  ambiguity 
with  multiple  NPs.  NL, -Soar  simply  handles  this  ambiguity  at  the  level  of  syntactic  features, 
like  nearly  every  other  approach  to  natural  language  processing  (e.g.,  unification  grammars). 
The  head  of  [  the  sheep]  contains  a  set  of  syntactic  features,  including  number,  which 
may  take  on  multiple  values.  These  features  are  restricted  as  required  by  agreement  checks. 
When  [\r  the  sheep]  is  attached  in  spec  -IP  position  of  hr  seems],  spec-head  agieement 
ensures  the  number  will  bv:  set  to  singular:  when  the  attachment  is  to  hr  seem],  the  number 
is  set  to  plural. 


Inflection  marker/preposition 

To  may  be  locally  ambiguous  as  a  preposition  or  an  infection  marker,  but  ihe  ambiguity 
need  not  cause  difficulty  (Gib.son,  1991): 

(206;  UPA24)  (a)  I  opened  the  letter  to  Mary. 

( b)  I  opened  the  letter  to  impress  Mary. 

Figure  6.20  shows  how  NL-Soar  repairs  from  the  preposition  reading  to  the  inflection 
reading,  [pp  To]  initially  attaches  as  the  complement  of  [,v'  letter].  When  impress  arrives. 
It  projects  to  VP  and  attaches  as  the  complement  of  [;p  to].  This  triggers  the  snip  of  ]pp  ro], 
and  the  initial  clause  adjoins  to  hp  to  impress  Maty],  completing  the  repair. 

6.2.5  Filler-gap  ambiguities 

Filler-gap  sentences  provide  another  interesting  test  of  NL-Soar's  repair  mechanism,  be¬ 
cause  the  location  of  the  gap  is  often  not  known  with  certainty.  Consider  (207): 

(207;  UI’Al'S)  (a)  John  found  (he  hall,  that  the  hoy  hit  t,. 

(b)  John  found  the  ball,  (hat  the  boy  hit  the  window  with  l,. 

In  (2()7a),  the  trare  appears  m  the  complemc.al  position  of  the  verb  {the  ho\  hit  the  hall), 
while  in  ( 2(  )7b),  I  lie  senlence  continues  so  I  ha'  the  trace  appears  as  the  ^ifiject  of  a  preposition 
( the  ho\  hit  the  wiruhne  with  the  hall\  Neiifier  sentence  causes  dif^lcult^■.  Figure  6.2  i  shows 
lu)\s  Ihe  lep.uf  of  the  object  liace  is  triggered.  First,  |  yy  the  windaw  j  arrives  ant'  atlaclies 
as  the  compfemenl  of  jv-  hit)  I  his  cic.ites  the  lamihai  Ita  ai  iik onsisIencA  :  two  node,^ 
(Ihe  iiace,  and  the  NP)  eeeujyving  the  same  strucUiiai  position,  it  is  inelexani  that  one  u! 
the  noiies  is  .i  phoiiojogiealK  null  iiace.  A  snip  is  generated  to  iviiiove  the  trace  U'hen 
l/V’  u.';7/|  .idjoi.ns  to  /;!,’!  .5  lies',  tiaee  is  gcneiateu  as  tiie  objer  i  ot  the  j>ienr.>sition. 

.\  slii.king  i.'spei  t  ot  iIk  so  liifer  ga.p  ainlngunies  is  ifni'  niav  l>e  jnopa.gated  oxer 

!' an  iii >i.iiu'es 

1  .’( )S,  I.  i'  s  .  I I  W  fio  do  \  on  beliex  c  ’ 

i.lo  \V  lio ‘.to  xon  iH-liew  I'm  siispeeis  lA'h  know  s  I'.i!  liales  ' 
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FiguB£  6.20:  Repairing  an  inflection  marker/preposition  ambiguity. 
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Prcdicfions  on  the  GP/UPA  colk'ction 


An  object  trace  is  [xrsited  after  each  verb  in  (208b).  As  an  NP  arrives  to  fill  the  object  slot, 
the  ti'acc  is  snipped,  and  a  new  one  generated  at  tiie  next  verb,  'rhus,  (208b)  involves  three 
separate  repair  operations. 


6.2.6  Smal.1  c1jiuss:,s,  coo.rd illation  and  othes-  miscellany 

Small  clauses 


Small  Claeses  are  subject/prcdicate  constructions  'hat  do  not  invoh  e  the  full  IP/VP  phrase 
structure.  Small  clause,s  may  consist  of  just  a  VP,  with  the  subject  of  the  small  clause  in 
.spec- VP,  Pritchett.  (i9P2)  presents  the  following  unproblematic  ambiguity  involving  a  VP 
.small  clause; 


(?.09;  Uf'A27)  (a)  i  saw  her  duck  fly  away. 

(b)  I  .saw  her  duck  into  an  alleyway. 

The  are  actually  several  repairs  involveii  in  (209).  First  is  the  reanalysis  from  1  vp  saw  |jvp  her]  j 
to  \vr  saw  (A/r  her  duck]],  which  is  ,uj  instance  of  object/specifier  ambiguity  di.scu.ssed  in 
§6.2.1 .  If  the  sentence  then  continue, s  as  in  (209a), //y  projects  to  a  VP  and  attaches  as  the 
complement  of  [v  ,vaw],  triggering  the  snip  of  [np  her  duck].  [a7>  her  duck]  then  attaches  in 
spec-VP  position  forming  the  small  clause  (ve  [v,"  her  duck]  fly]. 

Of  course  duck  is  categorially  ambiguous;  it  may  be  a  noun  or  verb,  Figure  6.22 
shows  what  happens  if  the  .sentence  continues  as  in  (209b),  The  PP  {pf>  into]  attaches  as  the 
complement  of  [v'  duckf.  This  triggers  the  snips  to  detach  [np  duck] — (.:)ne  snip  to  remove 
[atp  her]  from  (wr'  dimk],  and  another  to  remove  [at/^  duck]  from  U/<  .wivvl.  Next,  (jv/'  her] 
attaches  as  specifier  of  [»//>  duck],  forming  the  small  clause  [vp  [ve  her]  [v<  duck  [/'p  mrojlj. 
Finally,  the  small  clause  attaches  to  h/>  a.s  the  complement,  and  the  repair  is  complcfc. 
'This  sentence  provides  a  demonstration  of  how  NL-Soar  can  handle  multiple  kinds  of 
struclurai  and  lexical  afubiguity  within  one  sentence. 


Ciiordinadon 


( "otirdii'ialK;n  has  recciv'cd  little  empirical  aitentson  in  p.sycholingrustic.s  (ii’.uugii  there  is  a 
fair  aniourU  ol  vvo.'^k  i\  compLitaiiorr.d  luigui.stic:^,  e.g..  ( Ko.sy,  I  ).  and  l!,ierf  Is  potenliallv 
,1  I'liige  la.'U-'C  Cl;' inlercsting  garJeii  path  effects  wailing  to  i?e  (iiscovcied.  t  iowever,  for  the 
(Hc-sent  piu'i'Hises,  vve  shall  address  isisi  ihe  most  lia.sic  find  ol  iv, analysis  reqiintai  by 
C'ooid:  llC'C  :a/  ilv'tutvs: 

I I  n  I  !  {;ii  I  vsent  to  the  mall 

(b)  !  '.vein  lo  tin:  m;i!)  and  flue  bov.lvsions 

'iiii',  ciia'iire  I'  i,m1K  i'|iiiKii!\  ,.iiiil'i!'iu)ii,.  ,\a,r  haii.'  jr/i'i/i'i  U'  .nnj  [in 

,  i  .!!■', -.1  i  inil  I  i,;'  u.  ■  I  he  soinew  !kU  o.K!  in  1  a,.  a/ni'iS.iht'  hrr  .Put  lit  i..ur  \  n  ,!•  , 
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Figure  6.22:  Repairing  a  VP  small  clause  ambiguity. 


Figure  6.23  shows  how  Nc.-Soa'-  would  handle  the  repair'*^.  Initially,  h^p  The  mall]  at¬ 
taches  as  the  complement  of  [/»  to].  And  arrives  and  projects  its  own  maximal  projection. 
Since  a  conjoined  phrase  takes  on  the  syntactic  features  of  its  conjuncts.  the  node  pro¬ 
jected  by  and  does  not  yet  h  s/e  category  information;  this  is  represented  in  the  figure  by 
denoting  tlie  node  as  XP.  Next  j.vp  attaches  in  complement  position,  triggering  a  .snip 
operator  to  remove  |,v/i  the  maH],  whtch  can  then  attach  as  the  first  conjunct  of  (,,■/.  and], 
Ficaily,  the  h()()k.sfon-]  becomes  the  .second  conjunct,  forming  the  conjoined  phrase 
Iw  l.v/-  the  mall\  and  |,\7'  the 


Mullipli'  compounding 

Pnichctt  { 1992)  poi.its  out  lhat  nouns  may  be  comiKumdcd  multiple  limes  with^.uit  causing 
diriicultv: 

1  i’ !  1 ,  I  U'A;’'*')  (a)  VVe  .idinue  tlieir  intelligence. 

si’)  \\c  adnuie  then  intelligence  agenev  p'’lii.y  decisions 

( 'oiii[.ioundi(n!  m.iv  ‘v  analysed  as  ;id|.a'Ctioii  to  the  he.u!  o,  an  NP,  ( 'ndei  dns  .nmlysis.  the 
u  p;uf  tioui  n  susifde  N  lo  a  vaunpiunid  N  siruplv  un (aive-,  ilii’'  standard  ad|uiii.  tton  opei aio:, 

I  iif  .f  'i.l|  ii.iif  si!  !K  U,  I :  ti:.'  r  no!  \  I  I  bfir'i  iin['lenif/iioM  u:  I.i’c  slcnu 
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Figure  6.23;  A  repair  involving  coordination. 
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F'lGliRE  6.24:  Multiple  coinpounding. 


whicli  crcatc'.s  the  additional  required  node.  Fhu.s,  inullipic  coinpounding  consists  of  a 
sticani  of  ..d junctions,  as  in  Figure  6.24.  No  sain  openilors  are  n-'inired. 

Scrnanlic  v.v.  strucinrn!  amhii^uity 

finis  far  al!  the  sn in  'iires  vve  have  considered  uivolved  local  svntai.'tic  ainlnguitv  tliat  i.s 
resolved  later  in  the  sentence  by  additional  syntactic  iiifoi nialion,  .Some  finds  of  ksca! 
ambn’Mitv  thitl  ..re  lesob  ed  svntactically  are  purely  semainu;  ui  natuie,  ami  vhcrcloie  dia  m.ii 
retjuifi'  repair  of  the  iitleiance  inodel.  ('onsider  the  examples  below. 

clsXKi)  tai  I  gave  the  dog  loMa>v. 

(b)  !  ).%!%;.  tile  dog.  V'.me  boncx 
i.').v  l’P.-\ui  lai  .lohn  (SLkeil  the  bo\  foi  hss  team. 

(f!)  .loha  picked  the  I'oy  up  ai  :a;fjoo!  ycNtcukiN 


Chapter  6.  Garden  Path  Effects  .and  IJnprohleirnuic  Ambiguities 


1.  Recoveraoiiity 

2.  Bidirectionality 

3.  Independence  of  length 

4.  Distance-to-disambigualion  effects 

5.  Independence  of  lexical  ambiguity 

6.  independence  of  semantic  content 


Figure  6.25;  General  garden  path  phenomena  (from  Chapter  2.) 

In  (2 1 2),  [f^p  the  Jog]  play.s  a  different  semantic  role  in  the  sentence  depending  on  the 
syntactic  structure  of  the  second  complement  of  give  (whether  or  not  is  a  PP).  Yet,  regardless 
of  the  outcome,  [,vr>  the  dog]  remains  in  tlie  same  structural  position,  so  no  repair  of  the 
utterance  model  is  required  (the  situation  model  may  be  repaired  as  described  in  §3.4.3). 
Similarly,  in  (213),  the  meaning  of  picked  changes  with  the  arrival  of  the  particle  up,  but 
the  structural  configuration  remains  the  same. 

Sometimes  local  syntactic  ambiguity  is  resolved  by  later  semantic  content.  In  general, 
such  conditions  can  give  rise  to  semantic  garden  paths  (§2.4),  for  example: 

(214)  British  left  'waffles  on  Falklands. 

The  basic  NL-Soar  account  of  such  effects  is  .straightforward;  repairing  the  structure  on¬ 
line  requires  the  recognitionai  generation  of  the  appropriate  snips  to  effect  the  repair.  The 
rather  specific  semantic  contexts  of  these  semantic  garden  paths  make  it  unlikely  that  the 
appropriate  repair  sequence  will  be  available  as  a  chunked  process. 


6,3  Accounting  for  the  major  qualitative  phenomena 

Now  that  we  have  completed  the  lengthy  journey  through  the  range  of  GP/IJPA  construe 
Iiinis,  we  can  step  hack  and  consider  how  NL-Soar  accounts  for  the  six  major  qualitative 
phenomena  surrounding  garden  path  effects,  summarized  in  Figure  6.25. 

Recoverabilitv  People  can  eventually  recovery  from  garden  paths  through  deliberate  re- 
coinprchcnsion,  perhaps  guided  by  explicit  instruction.  NL-Soar  predicts  that  garden  paths 
•ire  recoverable  because  the  knowledge  used  to  resolve  ambiguities  is  not  architecturally 
lixeii  n:  advance  We  sau  in  Chafriei  4  how  NL  Soar  is  able  to  deliberalelv  recoiripretieiui 
iuiinnsnc  input  to  explore  alternative  patfis  at  ambiguities  Tins  capability  is  lust  what  is 
leijinicd  to  lecovei  fiom  garden  jialh  etfects  {in  fact,  the  exainjile  in  (fiapiei  4  invohtai  die 
classic  inam  xeiii/ieduced  relative  garden  path). 

Itidio'i  ttonaliiv.  ( ia.'tleii  palti  ettecis  iiiav  arise  even  xxtien  a  notiiKilly  unjiielened  patti 
is  taken,  and  the  fireterred  nilerpretation  tunis  out  lo  be  correct  N[.  Soar  predic  ts  tins  is 
possiMe  hei  ause  the  (  d’  elleel  is  purelv  a  tiinetion  ol  the  ahihtv  ot  the  (e[iau  tneclianisrn 
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to  transform  one  structure  into  another;  the  preferred/nonpreferred  status  of  the  structures 
is  irrelevant.  We  saw  this  in  GP4  and  GF13. 

Independence  of  length.  Since  NL-Soar’s  repair  mechanism  maps  structure  into  struc¬ 
ture,  the  length  of  the  surface  string  is  not  the  critical  factor.  NL-Soar  predicts  the  existence 
of  short  garden  path  sentences  (e.g,  GPI5  and  GP20)  as  well  as  unproblematic  ambiguities 
with  extended  distance-to-disambiguation  (e.g.,  UPA3). 

Distance-to-disambiguation  effects.  Although  length  is  not  the  important  factor,  in¬ 
creased  distance-to-disambiguation  can  lead  to  a  garden  path  effect.  In  NL-Soar,  this 
happens  in  just  those  cases  where  the  structural  characteristics  of  the  intervening  material  is 
such  that  it  causes  loss  of  the  critical  relations  from  the  A/R  set.  We  saw  this  in  the  contrast 
between  GP2,  UPAl,  and  UPA3.  Thus,  NL-Soar  predicts  that  there  are  not  pure  distance 
effects,  but  structurally  modulated  distance  effects  arising  from  syntactic  interference. 

Independence  of  lexical  ambiguity.  Because  NL-Soar’s  repair  maps  structure  to  struc¬ 
ture,  lexical  ambiguity  is  only  relevant  to  the  extent  that  it  has  structural  ramifications. 
NL-Soar  predicts  that  lexical  ambigu'ty  is  neither  necessary  nor  sufficient  for  causing  gar¬ 
den  path  effects.  This  general  prediction  can  be  seen  clearly  from  the  results  on  the  many 
examples  in  §6.2.4  involving  lexical  ambiguity. 

Independence  of  semantic  content.  Semantic  ambiguity  need  not  cause  a  garden  path 
effect.  We  saw  in  UPA30  and  UPA31  that  the  utterance  model  need  not  always  be  repaired 
in  the  case  of  semantic  ambiguity.  Fu.rthermore,  any  required  situation  model  repair  can 
be  accomplished  directly  by  the  construction  processes  that  keep  it  in  correspondence  with 
the  utterance  model  (§3.4.3). 


6.4  Summary  and  general  discussion 

Table  6.1  summarizes  the  predictions  on  the  GP/IJPA  collection,  rhe  results  are  good: 
the  theory  accounts  for  52  out  of  57  constructions  (9)91))  Only  the  theories  of  Pritchett 
(1992)  and  Gibson  (199!)  have  comparable  coverage,  an  .l  no  other  architectural  theory 
competes,  hurihermore,  no  other  existing  theory  accounts  foi  all  six  qualitative  phenomena 
(lor  example,  NL-Soar  oilers  the  lust  explicit  model  of  how  people  might  actually  recover 
from  garden  path  effects).  Since  N1 ,  Soar  embodies  the  Structural  (Jarderi  Path  Hypothesis, 
the  good  results  ol  the  theory  oiler  iurther  support  for  tins  hypothesis,  along  with  Pritchett 
(  1992)  and  Gilr.on  (  1991 ) 

I’eiiuips  the  biggest  theoretical  issiii-  lat ms,'  the  model  is  the  origm  oi  the  repair  met  h 
aiusiii.  What  gives  use  to  tlie  paiticulai  locaiitv  consliaint  embodied  m  the  geiieralui  loi 
the  snip  o[K-ratoi  '  Altliough  the  coii'-Iraiiit  ;s  simple  enough,  aiul  the  inesence  ol  some 
coiistmml  Is  well  iiioliv, ued  l  om|)alatii.inal!\ .  the  |)ri'<,  ise  toi  m  ol  the  eoiisiiaiiii  t ould  [ios 
Mii|\  be  .il’.eied  shghlh  and  still  eiisiiie  etlieient  uqsin,  (.A  tomi  ol  this  qiieslion  i,  ,in  be 
pin  to  ne,uls  e\er\  othei  ,stn:eiiiial  g'aideii  [Kith  theoi\  I  It  is  mteiesim!'  to  note,  ho\o‘\'ei. 
III. It  til'.'  uleiili Ik  .it ion  ot  the  maximal  |'io|ei  lioii  ,is  ihe  i  cmsti  amui'c  lor  ahis  lor  sni[>  mi.'ans 
that  i’le  theiuN'  conKl  be  m.i['pe(l  riitlua  smipK  onto  ladu  .ills  ditteienl  stiiictmes,  -.ueh  ,is 
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Table  6. 1 ;  Summary  of  predictions  on  the  UPA/GP  collection. 


Objectysubject/specifier 


UPAl  • 
UPA2  O 
UPA3  ® 
IJPA4  # 
UPAS  • 
UPA6  • 
UPA7  • 


GPl  • 
GP2  • 
GP3  • 
GP4  • 
GPS  • 
GP6  • 
GP7  • 
GPS  • 
GP9  • 
GPiO  • 


Complement/adjunct 


Main  verb/reduced  re.lative 


Lexical  ambiguities 


Filler-gap 


Small  clauses,  elc. 


UPAS  • 
UPA9  * 
UPAIO  O 
UPA! 1  • 


UPA12  • 
UPAl 3  O 
UPA14  • 


UPA15  • 
UPA 16  • 

UPA17  • 
UPA18  • 
UPA19  • 
UPA20  • 
UPA2i  • 
UPA22  • 
UPA23  • 
UFA24  9 
UI^2S  ' 
UPA2b  • 
UPA27 
I IPA2H  • 
UPA29  • 
U(‘A3(.'  • 

UPA  31  ♦ 


GPU  O 
GPl  2  O 
GPl  3  • 


GPl  4  • 

GP15  • 
GP16  • 
GPL?  » 

GPl  8  « 
GP19  • 
GP20  • 
GP21  ® 
GP22  • 
GP23  • 
GP24  • 
GP25  • 
GP26  • 


•  correct  predn  non 
(-’  ■  incorrect  predictiori 


().'}.  Suinrnaty  and  ycncn!  discussion 
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(iepeiuicncy  trees.  In  a  ricpendcncy  grammar,  maximal  projections  corresirond  directly  to 
single  lexical  nodes. 

Of  course,  there  arc  the  mis.sed  predictions  to  be  accounted  I'or.  Three  of  the  five  incorrect 
predictions  involve  complement/adjunct  ambiguities,  v/hich  may  help  focus  re, search  for 
ways  to  improve  the  theory. 

!n  summary,  although  there  arc  still  important  issues  to  deal  with,  these  results  help 
establish  the  viability  of  the  broader  class  of  single  path/simple  repair  models.  By  virtue 
of  being  embedded  in  the  larger  architecturally-grounded  theory,  NL-Soar  also  explains  all 
the  major  qualitative  garden  path  phenomena,  and  opens  the  door  to  an  understanding  of 
how  learning  modulates  the  recovery  skills  of  adult  comprehension. 


Chapter  6.  Garden  Path  Effects  and  Unproblematic  Ambiguities 
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Chapter  7 


Immediacy  of  Interpretation  and  the 
Time  Course  of  Comprehension 

ONE  OF  THE  PRIMARY  CONSTRAINTS  ON  NL-Soar  is  that  it  accomplish  the  func¬ 
tions  of  comprehension  in  an  incremental,  real-time  fashion.  As  we  saw  in 
Chapter  2,  immediacy  of  interpretation  is  a  fundamental  principle  characterizing 
human  sentence  processing.  Although  nearly  all  comprehension  models  embody  some  kind 
of  incrementaiity,  most  do  not  make  predictions  about  the  hme  course  of  comprehension. 
T  he  READER  model  (Thibadeau,  Just,  &  Carpentet,  1982)  is  a  notable  exception.  Because 
NL-  Soar  is  grounded  in  a  cognitive  architecture  with  an  independently  developed  temporal 
mapping  (§3. 1  2),  we  can  u,sc  NL-Soar  to  make  chronometric  predictions,  The  first  section 
of  this  chapter  examines  why  NL-Soar  is  an  immediate  interpreter.  Next,  we  consider  how 
NL-Soar  satisfies  the  real-time  constraint,  by  analysing  the  structure  of  the  model  as  well  as 
actual  system  behavior.  I'inally,  the  model  is  used  to  make  a  number  of  specific  predictions 
(both  (.jualitative  and  quantita'.ive)  about  the  time  course  of  comprehension. 


7.1  NL-soar  as  an  immediate  interpreter 

in  geiuTal,  huni.ui  compr-ehension  orocesses  operate  immediately  at  ah  levels  svntactR', 
semantic,  .iiid  lefeicntial  is'2.2).  T  here  is  noevideitce  ol  sysicni.uic  del.avs  at  any  ie  .ei  .As 
soon  as  the  lelevanl  mi'iils  are  piesent.  the  comprehension  [m ocis-.scs  applv,  mctemeiUall\ 
tnnidmp  lepresentations  m  xnrkmg  memory, 

fins  IS  an  adecjiiali-  cliaractcn/atioii  ot  N1 .  ,Soai  as  \sel!  In  l.icl,  miiiiediac',  is  the  most 
iialni  ,il  pi  occssiny  sii  aiepN  loi  \'!  .So.n  It  is  the  naitne  >sl  the  iesoeiiiuon  memoiv  in  .So.u 
that  assocnitions  Ine  .uilom. tin  ally  wiieiievei  tlieii  reles.iin  ron.liUons  are  j'lesent  I  heo' 
IN  no  I'enn.d  exes  iiii'.e  schednlms'  the  jiioeesse.  i  bis  ix  .i  tsisiv  pioiieiiv  o|  piodiK  tu>n 
s\\teiiis  in  yene'.ii  (  Nev.idl,  I'.i.Mai,  and  .il.-o  .icci'imi.'-  A>(  flu'  inmiedi.ilc  [noci.  ssiiii,'  m 
in  \l  li  I'  (  liisl  A  t  '.iipenU'i,  i'OS  ,'  ', 

III  side  i  tio\\  s\  1  ilaci  u:  numediaev  .u  I'-cs  m  N !  .'soar,  I  lik'  s\  niact  ic  pmh  ex-.is  .u  >  tlir  sei 
ol  ,i\ .iii.iblr-  !i  s  oiisti  netoi  s  Vviienevei  llie  cotuiit'Oiis  o!  a  p.ii  l  icii  l.u  ii  lonsiiustoi  pioi'ox.il 
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associalion  arc  salislicd  (§3.3,4),  lha(  a.ssociatioii  vvil!  lire  proposing  the  ii -constructor. 
Unless  then.'  is  otlier  cognitive  processing  that  requires  immediate  attention  (i.e.,  other 
cognitive  ofieratoi  s  are  given  better  preferences  by  recogjiitionally  available  search  control), 
the  u-coristructor  will  apply  at  the  next  decision  cycle,  that  is,  within  the  next  .50  milliseconds 
r)r  so.  This  immediacy  hi.ilds  tor  semantic  interpretation  (s-constructors)  and  reference 
resolution  (resolve  operators)  a,s  well. 

Of  course,  o.ily  what  is  immediately  available  in  the  recognition  memory  (learned 
chunks)  can  be  immediately  applied,  so  NL-,Soar  predicts  that  there  are  limits  to  immediacy, 
and  these  limits  are  modulated  by  experience.  The  basic  prediction  is  that  the  more  novel 
and  specific  a  particular  aspect  of  the  comprehension  process,  the  more  likely  that  automatic 
comprehension  processes  will  fail,  requiring  more  deliberate  comprehension. 

Consider  what  this  means  with  respect  to  the  three  basic  processes  in  NL-Soar  (syn¬ 
tactic,  .semantic,  and  referential).  We  should  expect  that  syntactic  proc.ssing  will  proceed 
automatically  most  of  the  time  for  adults,  since  the  chunked  u-constructors  are  applicable 
to  an  unbounded  variety  of  sentences.  (Of  course,  this  is  modulo  the  range  of  failures 
discussed  extensively  in  Chapters  4,  5  and  6.)  Similarly,  since  the  s-constructors  build  up 
reference-independent  .semantic  representations  based  on  the  meanings  of  words,  they  too 
should  applied  recognitionally  much  of  the  time,  though  they  may  fail  more  often  than 
the  u-constnictors.  The  referential  processing  is  a  different  matter  altogether.  Even  in  a 
straightforward  di.scourse,  the  referential  level  may  be  a  source  of  novelty.  In  NL-Soar. 
this  is  also  the  level  ol  processing  that  integrates  the  content  into  long-term  memory.  Some 
referential  processing  can  be  automatic,  but  if  there  is  any  new  content  in  the  di,scoursc, 
then  impasses  must  arise  at  some  point. 

It  is  (liereforc  not  surprising  that  the  limits  of  iinmcdiacy  of  interpretation  have  been 
found  primarily  at  the  referential  level  (§2.2.3).  The  autoniatic/deliberate  distinctions  found 
in  these  studies  rmifis  well  onio  (he  struclure  of  NL  Soar.  However,  NL  Soar  docs  suggest 
lliat  attempts  to  tind  purely  static  criteria  for  (.ieterminirig  which  aspects  of  comprehension 
are  aiilomatie  are  ulhmalelv  doomed  lo  failure.  Instead,  Ilie  picture  llial  emerges  Irom  NI. 
Soar  IS  oiu  in  which  (he  autoinalit  /dehivrate  ik.stmcliun  is  a  limclion  (lornain  knowledge 
.iiid  pnoi  cxfX'iR'iue  u  ilti  similai  l.mguage. 


7.2  Satisfyinj^  (he  real-tiine  constraint 

I  >oi*s  Nl  Soai  111  I.r  !  s.iiisls  (lie  leal  lime  eoiisti.imt,  eompietieiuim!'  .n  an  awi.ige  lale  tg 
.’■'il  ni'.  [lei  Wind  ’  111  iliis  seeiion  ue  l.ike  l\\i>  ap[i!o,khe\  lo  .inswi-img  this  qiieslion 
,i!i.i!  \  SI  ne  [lie  si i  in  uii e  ol  die  m> ulel,  .iiul  ,m.il\  sing  ilk  .k  lual  l'eh.i\  loi  ot  idn;  i node! . 

I  -- 1 1  le  some  1 1  'll  eh  ..-si  III  i.tle-,  ot  model  heha\  n  n,  il  is  [>ossilile  lo  dele!  in  i  ne  1 1  N 1  .Se.n  is 
w  I !  i  Ii  II  .1  I'l.ii  !si  hie  I  .I'liee  !  n  s!  eonsidei  !!o\',  inneii  I  line  is  speiil  on  ieeoeniiioii.il  liiiei  .iik  e 
I n  hU'I  .iiiii  SI m.il  lo: M'loi le!  1.  oils; !  uel Ion  !  .iv  h  \v of d  s\  i II  e  ■.  i>ke  .it'onl  I  u  eoiisi 1 1 u  loi  s  .ind 
I  1  Oh  •,(  1 1  11  e  'I  s  I  ,ik  1  !i"  1  eipej  .iloi  s  ,is  (he  ,i  \  ei  .ig-e  loi. if  .11  id  e( )  1 1  is  pei  i  os'  i  nl  i  \  e  i  g  'ei  ,iioi 
,is  ihe  ,iu  h  !U\  1 1 1 1  ,il  .s  'I  ili.ii  means  .iln  nil  I  ‘^o  ms  pel  w  oi  d  i speni  on  i  eeogm  ( ion.!  1 

iiiovie!  I  1  Ills!  Ilk!  lon  l-Miu'  '.peiil  I'M  die  lesobe  ‘''I'ci  .i|i  e.  s  is  mole  diilienll  !o  e-iim.ike 
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because  the  amount  of  processing  evoked  by  different  words  varies  considerably.  Suppose 
that  each  word  evokes  on  average  one  resolve  operator.  This  brings  the  total  to  200  ms 
recognitional  processing  per  word.  Thus,  this  analysis  suggests  that  the  model  passes  the 
hrst  critical  test:  the  predicted  recognitional  time  per  word  is  less  than  the  average  time 
per  word  for  human  comprehenders.  As  discussed  above,  there  must  be  suiu>.  time  left 
over  for  more  deliberation.  However,  the  analysis  also  shows  that  the  fit  is  tight,  leaving 
only  an  average  of  50  ms  per  word  for  deliberate  processing.  Put  another  v'ay,  the  analysis 
indicates  that  comprehension  is  about  80%  recognition  and  2.0%  deliberation.  Real  time 
comprehension  clearly  presses  the  architectural  limits. 

To  establish  with  more  confidence  that  NL-Soar  meets  the  real-time  constraint,  we  must 
determine  several  factors  empirically: 

•  How  often  u-constructors  and  s-constructors  impasse  (the  transfei  rate  of  u-constructor/s- 
constructor  chunks) 

•  How  many  s-constructors  and  u-constructors  apply  per  word,  on  average 

•  How  often  resolve  operators  impasse  (the  transfer  rate  of  reference  resolution  chunks) 

•  flow  many  resolve  ope.'ators  apply  per  word,  on  average 

As  argued  above,  the  transfer  rate  of  u-constructors  and  s-constructors  should  be  good, 
since  they  constitute  fairly  general  processes.  There  is  some  empirical  evidence  to  hack 
this  up.  I'lgure  7. 1  shows  a  graph  depicting  a  learning  curve  on  a  corpus  of  51  sentences 
(ilovised  to  test  the  syntactic  range  of  the  system).  The  hori/ontal  axis  represents  the 
cumulative  number  of  words  comprehended;  the  vertical  axis  represents  the  perceiitage  ol 
words  comprehended  without  iinpas.se  (averaged  over  a  moving;  24- word  window).  I'he 
data  is  iroin  an  eaiiiei  version  of  Nl.-Soar  (l.etiman,  l.evvis,  A  Newell,  1991;  Steiei,  Lewis, 
l.ehniaii,  A  Zacherl  Pt'lt)  which  combined  utleoinee  .uui  situation  model  i'Uildmg  into 
one  operator.  I  lie  svMein  ilid  not  [leiloim  leieienee  lesoliition.  fhe  d;ita  are  still  relevant 
beeaiisf  ihi-  li.msler  iii  the  pussent  s\  slem  sfiould  hi'  .it  least  as  good  as  the  older  s\  stem 
(see  s’.L/  tor  ,1  discussion  ot  the  (i.inster  prtijierlies  ot  dilleiiiit  compielieiisioii  opeiatni 
schemes ). 

file  S',  stem  starts  w  ithnut  an\'  chunks  (li.u  lUieclh,  iiupiiemeiit  l  oinpietieiision  opeisitoi  s 
(lluiii'’h  the  er.iph  starts  above  /eio  iH'c.iiise  then'  was  some  i  hunk  tiaiislet  vv  iihin  the  tiis: 

.’4  vvcadsl.  W  ithout  le.iiiniig  the  giapli  would  he  a  dal  line  at  t)'  ,  !  he  icciM’iiilion.i! 

late  ol  SO  ')()',  .iclneved  iie.ii  the  end  ot  the  loipns  is  eiitiiidv  a  lesiil!  ol  ihr  tiaiisici  ol 
Icaiiicd  chunks.  Ii  seems  fifnisihle  (o  ,.issuir,e  tli.it  in  .idul!  coinpi eheiision  the  ti.iiislei  i.ilc 
I  o!  Ml  lei  ,m.  c  I  nodi  I  .aid  sitii.iiioii  nio  ii  I  coin  ( i  ;u  i  u  ai  e.  ,:v  en  h  i  e  hei 

Ihed.il.Mii  lieiiie  '  I  sidi  le.iv  es  vv  lull  is  poteiilMi  Iv  llii  inosi  sumuIu  .ni!  l.icU'i  un.id 
■- 1!  e  ssed  dll'  .1  u  loi  I  111  o!  1 1 1  III'  u'ljii  u  I'll  lo  pei  n  a  i  u  i  e  t  la  I'lii  ui !  pi  oc  cm  ne  !  \  ei  c  n!  i  \  i  on  ipic  i  ei  i 
'V  o;  k  hv  I  111  1 1  1 1  i.iil  ,  ! ' '  o  pi  o\  ides  soiMi-  i  .lit  If  I  1 1 1  le  1  i‘sl  111 .ill  1 1  1  e  le  \  ,  a  1 1  d . ! ! .  i  1  h .  il  he ins  |i  i 
.niswei  lliis  i|iie.!ioii  llnliin.iii  inv  est  n'.ite.  1  the  luiiuie  oi  nisti  iic.il'ii'  inleliieenl  .cenis 
w  II 1 1  I  i;sl !  IK  !o  No.  11.  ,1  s  \  s  li-iii  i  li.ii  Kike',  'i.il 111  ,ii  l.iui'i i.C'e  M  o,!  i  lie  1 1 oi  i ■,  io  .  oi ;!  i  ol  ,■  '-.m  m  l.iU'd 
lohoi  III  .'  I'loek'-  woild  dcairuii  Inslinelo  No,  i  hke  \!  No, a  i  .  hmh  v.iihm  ihe  N.-.n 
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Words  comprehended 

FU’iURr  7. 1 :  I'Micicncy  inca-asc  iliic  tucliiink  iratisfci  with  an  ol'lcr  version  of  NL  Soar,  The  graph 
shows  the  percentage  of  ssonls  comprehended  without  impasse,  computed  ov  er  a  moving  24  word 
w  iridow  WitfKHit  learning.,  the  graph  would  t>e  a  Hat  line  at  ()*:; 

arctutecliirc.  I'he  natural  language  component  of  Instructo  .So  u  is  a  version  of  N1  Soar 
vorv  close  to  the  one  lucsenleii  in  this  thesis  dheonlv  signiheaiit  dillerencc  is  the  nature  of 
the  svniaetic  lejiieseiuatioii i. 

Iiistiucio  .Soa,  was  niii  ihiough  .i  setjuence  o1  tasks  th.it  involved  the  cinnptelieiiMon  I't 
'<4  utter. iiK'cs,  coiii|irised  ol  44  f  wuids  1  he  uttei.iiices  consisted  cd  simple  insf  Uc  ticms  and 
e\[ilanalions  such  as: 

( .’ I  a  )  (.0  I'lc  k  U|i  the  vclli'w  hlock 

dci  Move'  to  the  giev  t.ihU'. 
ic  1  I  he  oper.stoi  is  timsticil 

\  tol.ti  ol  a(,(i  Li  coiisli  uctiii  s.  -iiti  s  ciinstiiK'tois,  ,iiid  (' Ol  ic’M'lvc'  cn.ei.iiois  weie 
ag|  li  leaf  1  he  i  lui  iif'ci  1 1|  s  ;  ensn  iu  loi  s  is  less  ih.m  .he  nun  ihci  ci!  wools  hci  ,iusc  i  n.iiu  \i  ou  is 
ii  c  ,  luiKtion  wolds)  did  Mol  evc'kc  s  coi'sti  licic'i  s  I'lif  iot.il  inmihi.'i  o.i  c  LMiipichi'iish  ni 
oj  1 'I  ,iioi  s  I  s  a(,t<  •  i  I  (i  t  (i  a,  !  ()  1  ,X  L )|  i.n  .n  er.u’c' ol  !  ,'S  pci  woi  d 

(  )t  I  Is  -  hi  I  I's,  4  \  e  L  ijH'i  ,i|oi  s,  ’ !  (>  Ic'd  to  I  inp.issc's  1  hi-u-  w  eu'  '  decisu  'U  c  \  c  Ics  spent 

loiisolxi  llu’se  iMipasses  loi  .,u  i  .1  v  ei  are  o!  4  hes  i  sii  ii  i  i  \  c  ies  iir  i  nap.isse  llio  is]iis( 
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493  words  (94  utterances) 

1,618  total  comprehension  operators 
3.28  comprehension  operators  per  word 

216  impasses  on  resolve  operators 

953  decision  cycles  Oii  :(:.ive  operator  impasses 

Total  decision  -  i.otr  ^53  =  2,57 S 
5.22  decision  cycles  per  word 

Average  time  per  word  5.22  x  50  rns  =  261  ms 
Comprehension  rate  =  1000  ~  261  x  60  =  --23G  v,rords  per  minute 


Figure  7.2;  Results  from  ninning  NL-SoaT  in  Inslmcto- Soar’s  doiTiain.  The  50  ms  per  operator 
architectural  constatil  permits  approximate  zero-pruumeter  predictions  of  comprehension  rate. 


enough  time  to  learn  to  recognize  one  or  two  new  aspects  of  the  situation  model  (§3.5.1). 
Assuming  that  the  u-constructors  and  s-constructors  were  fully  chunked  (they  were  not, 
ofcour.se — Instructo-Soar  started  with  no  language  chunks  of  any  kind),  the  total  decision 
cycles  spent  was  1 . 618  -v953  =  2, 571,  or  an  average  of  5.22  per  word.  At  50  ms  per  cycle, 
that  means  an  aveiage  of  26!  ms  per  word,  or  about  230  words  per  minute.  These  results 
are  summarized  in  Figure  7.2. 

This  comprehension  rate  is  remarkably  close  to  the  observed  rate  of  skilled  human 
readers  (~240  wpm;  (Ju.st  &  Carpenter,  1987)).  Although  this  it;  just  one  data  point,  the 
test  is  a  significant  one  because  it  embeds  NL-Soar  in  a  functional  task  situation.  Reference 
resolution  is  required  for  both  the  immediate  comprehension  of  the  text,  a.s  well  as  for 
producing  a  long-term  memory  of  the  instructions  to  be  used  in  later  behavior. 

if  NL-S(.;ar  appears  a  bit  ;,!ovv,  it  is  important  to  reaii/e  that  on  Instructo-Soar's  ta.sk,  NL,- 
Soar  is  taking  the  time  required  to  produce  a  complete  recrvgnition  nsemorv  ol  the  (.ontent 
^>f  the  utterances  (though  ihc  memory  must  be  retrieved  via  a  recon.siruclive  process  u  hich 
is  not  guarantcctl  to  succeed;  see  §  h  should  be  possible  io  speed  uj.i  NL  Si,).ai  ..ti  itu.- 

cos!  of  producing  a  shallower  comprehension  (Ju.st  &  (  'arixaitei,  1987).  Neverdieless,  dic 
luimbers  clearly  indicate  that  NL  Soat  is  e-perating  very  close  to  Ihc  limits  o,  tjit'  rral  time 
conslramt. 


Am.ither  factor  that  this  analysis  does  not  lake  latvi  acconut  is  ibe  i.»vcrhead  oi  lexica! 
t'Mcodiiig  and  access  Aitluuigli  tins  is  .tssjirned  tc'  take  icace  \'ta  cncoiiiug  production  Inch 
Ciiii  n.m  111  [lar.dlei  \s  ilii  (  ■ih.-f  corni'refiensaon  OjX‘raiois  !  fi  .vi,  tlieu-  arc  stiii  utl.etc  it  dat;! 
liefrciiilcfK  ICS  v.hic!!  i ouhl  aiiiouui  c  adunioaa!  latencies  h  is  tlKarhac  eiitns'h  (lossihle 
doit  iiicoi T'oratiiu;  a  lUiirc  ts  alistii:  lexical  (.lonuionent  aiid/or  adiiituiiial  cx()cTieiu  s'  vs  ith 
o[(k  !  doll  .iins  will  leve.il  that  Nl,  Soar  is,  iri  lact,  too  slow.  In  i.ui\' s.  vt  iii,  iIk:  oo'sent  da'.t 


iiuiK.iie  ill  I’  ills.'  iic!ib(.ratc  processes 
the  real  liiiu'  Is;!!,;',  ivu 


ret  ere  lire  icsidutM.iri  need  no'  scrniusls'  laniipromise 
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7.3  Predicti«>iis  about  the  time  coarse  of  comprehension 

The  previous  section  was  about  averages— establishing  that  NL-Soar  is  capabie  of  the 
rapid  throughput  that  is  characteristic  of  skilled  comprehension.  However,  readers  do  not 
pend  the  same  amount  of  time  on  each  word  (Just  &  Carpenter,  1987),  In  this  section  we 
consider  six  predictions  NL-Soar  makes  about  relative  processing  times  as  a  function  of 
various  aspects  of  the  text. 

To  use  NL-Soar  to  make  predictions  about  fixation  durations  requires  adopting  the 
Eye-mind  Hypothesis  (Thibadeau  et  al.,  1982;  Just  &  Carpenter,  1987): 

(2 1 6)  The  Eye-mind  Hypothesis:  The  eye  remains  fixated  on  a  word  as  long  as  the 
word  is  being  processed. 

Although  the  Eye-mind  hypothesis  is  generally  well-supported,  it  is  important  to  be  explicit 
about  this  assumption,  because  without  it  no  predictions  can  be  made  about  relative  fixation 
times. 

The  first  prediction  is  that  more  time  will  be  spent  on  content  words  than  function 
vyords.  Content  words  may  evoke  all  three  operator  types  (u-constructor,  s-constructor, 
and  reference  operators),  while  function  words  tend  to  evoke  only  syntactic  structuring 
operators  (this  was  true  in  the  Instructo-Soar  corpus).  This  prediction  is  consistent  with  the 
data,  but  it  is  difficult  to  separate  the  effect  from  a  frequency  effect,  since  content  words 
are  not  as  frequent  as  function  words. 

The  second  prediction  is  that  more  time  will  be  spent  on  complex  syntactic  structures 
than  simple  syntactic  structures,  independent  of  ambiguity  and  semantic  content  The 
reason  is  that  more  complex  .structures  simply  require  more  time  to  build,  even  when  they 
are  built  with  a  single  u-constructor.  Figure  7.3  shows  why.  If  a  structure  contains  n 
links  such  that  there  are  inherent  dependencies  between  the  links  (link  .v,  cannot  be  built 
until  link  ..p.  i  has  been  established),  then  the  structure  will  be  established  with  a  ripple 
of  n  issociation  hriiigs  wdthin  the  ap[)lication  of  the  operaU'i  Assuming  that  associations 
openue  at  the  --  -  10  ms  level  (  -  3  30  rns),  a  structure  requiring  n  -i  j  associations  will 

'ake  -  lO  •  /  ms  longer  than  one  tequiiing  n  associations.  This  is  a  lower  bound  on  the 
addUioi);.'!  time;  more  complex  structures  are  also  more  likely  to  be  split  into  a  sequence 
o!  !\vo  u  t  onslruetoi s.  In  ihose  e.ises,  the  overliead  of  :in  additional  decision  cycle  will  lie 
uicunctl. 

i  t.--  thud  [iredielioii  r.  (hat  more  time  will  be  s[)ent  on  disaml/iyuatiny  reyians  when  ihe 
inconecl  miei [ueiation  has  been  pursued.  This  holds  lor  both  eaiden  path  slruciurcs  and 
uiijirohlenialie  ainluginlies  h;  garden  ['aths,  addi'.iona!  nme  will  clear  iy  be  nicurred  tieeause 
the  reeiigii'Ooii.i!  rcfcui  la.il.s,  leijiiiimg’  the  apphealion  of  some  deliherate  letoveiy  si; . 
The  uioic  interes! in;:  ease  is  tecoginlioiial  icp.air  ilsi.-lt,  I'ven  it  l!ie  pn)cer.s  is  diunke-d  mfM 
a  Miiglc-  11  e< )nsU  iictoi,  Ihr  sei.jueuee  ol  as.'-o<.-|.;Uons  imgileineiiting  die  o^Ruatoi  r  cx-c-Dued 
ailli  tiie  associations  dial  acconiji.lisii  ihe  snip  li.u  seijiicnce  o!  snij's)  atid  i eat'.K .hn icnls. 

[  1.11  the rinoi e.  [e[St a  s  i na.'i  lake  g'lace  as  a  sCLjueiiec  ( d  iwo  u  con.sli iictors:  . tni.,-  ii  t  oast i  ik  I’oi 
nu  ( a  [SOI  at  liig  tlic  iiuoniiri;..’  Miatcn.il,  and  a  sei.i>iid  u  e'oii  .ii  ikioi  ae'ea!)'.  oei  .ioniiiiy  ctie 
icjsm  and  leanadmient 
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Figure  7.3:  '('he  elaboration  phase  ol"  a  u-constnic!:or  application  lengthen.')  as  a  function  of  the 
number  of  .serial  syntactic  links. 


This  basic  qualitative  prediction  is  borne  out  in  nunerous  studies  exploring  attibiguity 
re.soiution  (§2.3).  As  an  example  of  bov/ the  theory  might  be  applied  q'’:t.ntiiative!y,  consider 
the  contrast  between  the  following: 


(217;  UPAl )  (a)  I  forgvit  that  Pain  needed  a  r<de. 

(b)  I  forgot  Fain  needed  a  ride. 

Sentence  (2 1 7b)  is  an  example  of  the  familiar  unprohiematic  subject 'object  ambiguity. 
The  prediction  is  that  the  fixation  time  on  the  disanibsguatiiig  region  net^dtui  wil!  be  longer  in 
(2 1 7b)  tiian  in  (21  7a),  because  one  adrlitioual  pioces.s  is  .ic-jusiea:  the  snip  of  i./vf  from 
(v  forgoih  At  inininiun:  tlic  u  consfi'uetc'r  will  require  an  addiborial  assf.)Cuu<on,  e.xicndirig 
the  operatoi  by  ---!()  ms.  .Sf  die  repair  is  acconiphshed  l:>v  a  separate  u-con.structor 
(which  snips  l,v/'  i'am]  then  yx-nforins  the  reattachmerit),  the  iinx'  v\  ill  be  extended  by  about 
,Sd  ms.  The  hiuer  account,  may  Ih”  correct.  .!n  an  eye  fexaoon  study  using  'natei'isi  gist 
like  (2! 7a),  Ferreira  and  Henderson  (19'X))  found  tha:  (isations  on  the  disambigiraung 
region  ;n  .scruences  without  the  overt  coniplementi/er  (2i7b)  wcie  -  7U  ms  njgfcr  th;tn 
iht  >r  unambiguous  counteipans  (.■  i7al. 

i'lic  Fourth  predict,u>n  is  t.hai  more  tune  will  lie  .syx  nt  on  afnh!i;n!HiS  rfU’ior;},  (h.,ui  uuupt 
/('guerv  ( Mac  Donald  ct  a),.  1'<‘>3  |.  Again,  tins  estia  tmu:  i.aFi  take  several  liar  iS 
loam  st  ih'-  amlngjitv  lesvitution  is  compleiciv  ctiunkrd,  die  decision  c\c)e  Ounng  whuli 
the  a hc!  nail  VC  ujiciatoi  s  aie  pioposcd  may  be  exlendcd  !.>v  a  sci acs  o!  associatioii".  ihai  ai  t  v 
oui  tin.  fiuKtmn  oi  amliieuitv  resohitsou  fl  igurc  .v^'o.  rtt(.'se  assotaa'ious  must  isve  -ilti'f 
■he  o;'tr:r;aoi  [uojuis,:!  assoenii ions,  hci  ause  their  iaindaiofi ■,  te  .s  foi  die  I'roposrd  oi'craloi  s 
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Thus,  there  is  an  inherent  serial  data  dependency  which  extends  the  decision  cycle  by  a  few 
tens  of  miiiiseconds. 

The  fifth  prediction  is  tiiat  there  is  a  fii-ed-gap  effect  (Stowe,  1986) .  A  filled- gap  effect 
occurs  when  an  overt  nounphrase  fills  a  syntactically  possible  gap  location.  Consider  the 
following  examples  from  Chapter  6: 

(218;  UPA25)  (a)  I  saw  the  ball  that  the  boy  hit  u  the  w'indow  with  yesterday. 

(b)  I  saw  the  ball  that  the  boy  hit  u  yesterday. 

As  described  in  §6.2.5,  sentences  like  (218a)  require  a  repair  at  the  NP  tke  window] — 
namely,  the  snip  of  the  posited  object  trace. 

.Alternatively,  an  impasse  may  arise  if  the  ambiguity  cannot  be  resolved  by  recognition. 
In  that  case,  the  comprehension  process  may  be  extended  by  a  few  hundred  milliseconds 
(or  even  more)  as  knowledge  in  a  lower  problem  space  is  brought  to  bear  to  evaluate  the 
alternatives. 

The  sixth  prediction,  already  discus.sed  above,  is  that  more  time  will  be  spent  on 
novel  aspects  of  the  text  than  familiar  aspects.  This  a  general  qualitative  prediction  that 
follows  from  the  nature  ot  the  referential  processes  in  NL-Soar.  Novel  aspects  will  not  be 
immediately  recognized,  giving  rise  to  impasses  that  build  up  the  long-term  memory  of  the 
di.scour.se. 

7.4  Summaj  y  arid  discussion 

Figure  7.4  summarizes  the  predictions  concerning  immediacy  and  the  time  course  of  com¬ 
prehension.  These  predictions  derive  from  the  structure  of  NL-Soar  as  well  as  analysis  of 
.system  behavior.  None  of  these  predictioiiS,  including  the  tempera!  predictions,  require 
additionr'  a.ssum  ‘ions  beyond  the  basic  model  and  the  Soar  architecture.  The  theory  not 
only  expl ns  the  fundmental  iiTitiiediacy  principle,  hut  also  accounts  for  the  observed  dis- 
nnetion  b^iv/een  automatic  and  deliberate  processes  in  comprehension.  Furtheunore,  the 
theorv  makes  jualitative  and  rjuantitative  prcdictior's  about  the  rate  u.f  comprehensior,  and 
the  '•dative  time  course  of  comprehension  as  a  function  of  certain  features  of  the  utterances. 

h  is  impcrlaut  to  realm*  that  the.se  temporal  predict irnrs  are  essentially  rera-pararneter. 
Although  the  approximate  nature  of  the  aicnilecUuai  constarhs  .means  the  predictions  are 
also  approximate,  and  (here  is  sometimes  more  than  one  possible  rcaliztitiou  ol'a  particular 
limctiot!,  there  are  <io  ilcgiees  of  fiecdoui  iii  mapping  the  system  hcha'noi  to  elap.scd 
time  In  iacl,  Nf.  .Soar  is  the  first  comprehension  mode!  to  make  '/.ero-parameter  temporal 
|)iedictioi.is,  '1  he  alulilv  to  tuakc  such  predictions  is  one  of  the  'vuuliall'  of  usiut?,  ;!  unified 
theo!  o!  tasgiiilioM  (  Newdi,  I  ttpO).  In  ifie  case  of  N! .  .Soar,  the  bir-..c  ardiilcciura!  coicaants 
liavc  ..licadv  been  estabiislied  by  cardid  analysis  and  application  (vi  other  (non- Im.iu.stK  ) 
task;:  ( John,  1 '/SH;  Nev.eli,  lbp(;,  Wicsnicyet,  ipuO), 

f  h  ovi'  io  j,  rlie  aiiaiyses  !i’ t;  /  .  2  a!:>o  raise  a  wai  ninit  ffag:  Nl .  Sisu  aiipear  s  to  fa-  opei  ating 
:ii  very  r  k'se  to  die  hums  u)  .lie  real  tune  constraint.,  and  m;iY  m  iact  be  too  sl>  e.r ,  .Aridii louai 
iiiotleluii'  and  enipc  usak  u  ili  be  iiccessiity  to  settle  die  issui'  conchisix cb  . 
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1 .  Comprehension  is  incrementa:  and  immediate,  at  all  levels 
(syntactic,  semantic,  referential) 

2.  Comprehension  is  mix  of  recognition  (automatic)  and  deliberation; 
referential  processing  more  likely  to  require  deliberation 

3.  Rate  of  skilled  comprehension  is  ~230  words  per  minute 

4.  More  time  spent  on  content  words  than  function  words 

5.  More  time  spent  on  complex  syntax  than  simple  (tens  of  milliseconds) 

6.  More  time  spent  on  disambiguating  region  if  wrong  interpretation  chosen 
(tens  of  milliseconds) 

7.  More  time  spent  on  filled-gaps  (tens  of  milliseconds) 

8.  More  time  spent  on  ambiguous  regions  than  unambiguous  regions 
(tens  to  hundreds  of  milliseconds) 

9.  More  tune  on  novel  aspects  than  familiar 


Figure  7.4:  Suinmary  of  NI .-Soar’s  predictions  concerning  immediacy  acd  the  i.ime  course  of 
comprehension. 

This  chapter  effectively  completes  the  an.swer  io  the  question  raised  at  the  beginning 
of  Chapter  1:  how  do  pieople  manage  to  comprehend  so  fast?  h  also  completes  our 
tour  through  the  nia-jor  nhenomena  of  sentence  comprehension.  Before  drawing  general 
conclusions  in  tfie  linal  chapter,  the  next  chapter  h:‘!e(1y  explains  how  NL-Soar  makes 
interesting  predictions  for  languages  other  than  English. 
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Chapter  8 

Cross-linguistic  Phenomena 


Cross-linguistic  data  ma}/  be  brought  to  bear  by  assuming  that  the  underlying 
architecture  of  comprehension  is  the  same  across  languages.  This  is  certainly  the 
most  natural  assumption  to  adopt  for  NL-Soar,  since  both  the  Soar  architecture  and 
the  additional  content  posited  by  NL-Soar  make  no  language-specific  commitments.  In 
this  chapter,  we  will  examine  NL-Soar’s  predictions'  for  a  variety  of  languages,  including 
many  with  structure  that  differs  significantly  from  English.  The  first  section  considers 
parsing  breakdown  effects  in  head-final  languages.  The  second  section  examines  a  number 
of  cross-linguistic  garden  path  efl^ects  and  unproblematic  ambiguities — some  replicating 
effects  found  in  English,  and  some  involving  structures  with  no  counterpart  in  Engli.sh.  The 
chapter  concludes  with  a  brief  summary. 


8.1  Parsing  breakdown  on  verb-final  structures 

Slacking  three  NPs  in  .'lentcnce-initial  position  lead,',  to  breakdown  in  the  cla.ssic  English 
center  embedded  coiistriKiion: 

1 2  1  d,  I’}!  1 )  ffie  man  the  cat  the  dog  chased  li*  es  cried 

t  hajiter  .2  showed  iio'v  N1 .  .Soar  accounts  t<  i  tliis  witli  the  two- valued  limitation  on  syiilactie 
indices  m  the  A/R  set.  However,  stacki.ag  NPs  is  much  less  problemaiic  in  head  final 
laiiguagi’S.  is  denioiisti.iie(.l  I'V  the  lollov.-ing  aece[)table  2  Nl'  initial  .lapanese  sentence 
( ( iibson,  I  t'd  !  ); 

i  220,  .M  I  loiin  u  a  l  ied  ga  Hill  o  suki  ila  to  oitiotleiiu 

.h)ti:i  1  retl  NOM  Hill  .Ar  t’  likes  r'OMl'  thinks 
t  totin  thinks  ihat  l  ied  likes  !Hll.l 

i  In-  N1  Sw.ii  -aria  ikir-,  ii.il  si’i  pnscss  liir  iiass  liiiriii-ai,  rs.uiii'irs  iii  llii-.  rli.ijari  ;  ihc  jai'ilii  iiiHs. 
air  ^iral:'illli'^v.  .iiailv  hv  tiaiitl 
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IP 


NP  VP 
Fred  ga  | 
V’ 


NP  V 
Bill  o  suki  da 


Figure  8.1:  Phrase  structure  for  Japanese  sentence  John  thinh:  that  Fred  likes  Bill,  showing 
head-final  syntax. 


However,  there  is  a  crucial  structural  difference  between  (219)  and  (220):  all  three  NPs  in 
(219)  occupy  spec-IP  (subject)  position,  while  at  most  two  NPs  occupy  spec-IP  in  (220)^. 

fl/7/1  occupies  complement  of  V’,  as  shown  in  Figure  8.1.  Thus,  no  single  structural 
relation  must  buffer  more  than  two  NPs  in  the  A/R  set: 


Rfceiveks 


spec-IP: 

comp-V’: 


l,v/>  John],  Fred] 
l/vp  Bill] 


Fvcii  4-NP-initial  sentences  may  be  acceptable: 


(221;  .\E)  John  wa  Fred  ga  biiruu  o  Dave  ni  agela  koto  o  kiita. 

.lolm  TOP  Fretl  NO.M  beer  ACC  Dave  l),\l  gave  C'GMP  Afc  heard. 
(John  heard  that  Fred  gave  beer  to  Dave.) 


ill  (22  1  ),  I  V/'  hccr]  and  |  v/  Ihivt’]  occupy  first  and  secoinl  complement  positions  of  [  grO’c]: 

j  spec-IP:  I/V7*  l<>hn\,  |  ve  l  ied] 

R[.(  rivi  i;s  comp  V^':  I  ve  ht  t  r] 

'  coinpi  V  .  Iv/’/tincI 

A  smiilai  sliLictiui' m  <  icmiaii  is  ac.t'cptable  tor  tiie  same  reasons  ((  iiiison,  I'lOi  ): 

i  he  M!  iiciui  .\i  [i«  iMtion  i  s|)Cc  I !'  \  ■,  '  aclimjct'nfi )  ■  >1  ll  c  iriiti.il  to[iic  in.iikni  NI '  in  jap.uirsc  sir  iii  i  im-s  is  .1 
iiialk  1  o(  Ifisile  .Ni iiir  n!  ihr  | iu'iIk  I !i ins  j'icscMt'‘d  in  lli.s  ctKi|Mci  li.-pt-nil  on  ilns  ijist tin.  I mn.  ifn  nii'h  ir.ii  l\ 
'.‘'..inij'lcs  s  an  he  iIon  imsI  tc'r  k  .hu  li  lln' disnnchon  is  i  i  nt  nil 
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(222;  AE)  Ich  glaube,  daC  John  Mary  das  Geschenk.  gegeben  hat. 

I  believe  that  John  Mary  the  present  given  has. 

(I  believe  that  John  has  given  Mary  the  present.) 

The  verb-final  subordinate  clause  in  (222)  stacks  three  NPs  without  causing  difficulty, 
since  the  NPs  are  distributed  across  multiple  structural  relations  (subject  and  complement 
positions). 

The  German  counterpart  to  (219)  does  in  fact  cause  breakdown  (Gibson,  1991): 

(223;  PB)  Der  Mann  den  die  Frau  die  der  Hund  biB  sah  schwam. 

(The  man  that  the  woman  that  the  dog  bit  saw  swam.) 

NL-Soar  accounts  for  this  in  precisely  the  same  manner  as  the  English  version;  three  NPs 
must  be  indexed  simultaneously  by  the  spec-IP  relation. 


RECE'VERS 


spec-IP. 


Mann],  [yvp  Frau],  [i^f>  Hund] 


r224;  AE)  John  wa  Bill  ni  Mary  ga  Sue  ni  Bob  o  syookai  sita  to  it-ta. 

John  TOP  Bill  DAT  Mary  NOM  Sue  DAT  Bob  ACC  introduced  COMP  say  PERF. 
(Jotin  said  to  Bill  that  Mary  introduced  Bob  to  Sue.) 


Although  such  structures  may  be  perceived  as  somewhat  odd,  they  do  not  cause  the  pars¬ 
ing  breakdown  associated  with  English  center-embedded  relatives.  NL-Soar  can  handle 
structures  such  a;-.  (224),  since  no  single  structural  relation  must  buffer  more  than  two  NPs: 


spec-IP 

]\if  John],  l,v/'  Mnry) 

K1:CI:ivi;r,s 

coinp-V'’ 

l;V/-  Fob] 

e<)nip2  V’ 

Ive  Fill],  f\7>  ,S'i<el 

I'hc  overt  ease  marking  exhibiicd  in  (220),  (221 ),  and  (224)  does  not  in  and  of  itself 
explain  the  contrast  heSvveen  Japanese  and  Hiiglisti  stacked-NPs.  Even  it  the  case  .narkeis 
nnamlngiionsly  uienliiv  the  tina)  sinieUiiai  [lositioii  ol  the  NPs,  there  nuist  be  some  wav 
to  bnitei  the  stitictmes  until  tlu  verbs  appear  I'Uitliermoic,  itie  strucUira!  position  ol  the 
Nl’s  III  unambigiious  (•uglrsh  examples  such  as  |97)  is  known  mimediatelv  without  ease 
maiking,  vet  tins  a()|)arentlN  does  not  lirfii  die  tuiiiian  lancu.ige  piorcssvn.  in  ans  event, 
I  below  eonsiders  examples  ftiai  dcnuttisti  .ite  that  ease  markers  m  Jap.in.ese  do  nut 
aiwass  nii.imbis.’Uous! V  tnaik  stuiciuiai  positions. 

N'eit  all  .N'l’  slat  king  is  aci  cptable  m  .ia(>ane,se  1  he  tolinw ang  S  NP  hu1i,;1  '..enli.  .u  e 
dts.'s  cause  faeakdown  (I  iiiv'on.  I'tOl  )■ 

I  din  ;  I  die r n I  n  >  r > j  ;ivi  1  ‘i  lu  fic n  diul  h  'lin  Vv’lniiiuni  t( ",  hi.rli m\:  (I n s  c x.imi} '!c 
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(225;  PB)  Jon  wa  Mary  ga  Fred  ga  Sam  ga  Bill  o  sukida  to  omotteiru  to  sinziteiru  to 
omotteiru. 


In  (225),  at  least  three  NPs  must  be  buffered  on  the  spec-IP  relation: 

RECFitVEFt5~j~  Spec-IP:  Lv,°  Mary],  [/vp  Fred],  [yp  Sam] 

In  fact,  it  is  not  necessary  to  stack  five  NPs  to  cause  difficulty  in  Japanese.  Mazuka. 
et  al.  (1989)  present  several  examples  of  center-embedded  structures  that  lead  to  parsing 
breakdown  with  just  three  initial  NPs,  as  in  the  German  (223)  and  English  (219): 

(226;  PB)  Akira  ga  Tosiko  ga  Hazime  ga  nakidasita  toki  okidasita  no  ni  kizuita. 

Akira  NOM  Tosiko  NOM  Hazime  NOM  started  crying  when  got-up  that  noticed 
(Akira  noticed  that  Toshiko  got  up  when  Hajnne  started  crying). 

In  (226),,  each  of  the  NPs  occupies  subject  position  as  in  the  English  and  German  counter¬ 
parts,  requiring  the  A/R  set  to  index  three  NPs  on  the  spec-IP  relation: 


Receivers 


Akira],  j^p  Tosiko],  [/vp  Hazime] 


8.2 


L\S 


In  this  section  we  consider  a  niimbei  of  garden  path  effects  and  un[>rol)lemalic  ambiguities 
in  Japanese,  Mandarin  Cliinese,  Hebrew,  Korean,  and  Gcrinan. 


8.2.1  Japanese 

iapancse  case  do  nol  abvays  unamhigiiousiv  uicuidv  sliiu  tuiai  jiosiiion  Pr  ucheii 

(  !  on  j  )  prc'.enls  sc\'er;.ii  exanq.les  m  wlncli  the  sii  ucliii  al  posilion  ol  idenlicall  v  case  marked 
.NP-.  c'dli'i  s  depending  iMi  ihc  Imai  \'crh': 

J7;  lU'  (a)  iolin  ga  Koilrilo  ga  smela 
.ioliii  NOM  Imci  NOM  (bed 
(  l< I'lii  s  lo\ ci  died ) 

(to  iotui  iia  Kes  ga  Miki  do.n 
Jolin  sa  iM  R,_  \  NOM  |(iik!  is 
(  John  hki  ■.  ki'\  ) 

n  I  i I  >  ['■  '1  ill'.  '  nil  ih.il  !■  'i  iniit  ]iciiiit-iil  !’!  .iinni.m,  .il  i  oi'-  ni-,,  ilii"  r  c  ',.nii| 'Ir ■,  aiii  -.l  i  t  ri i iS h  i ihi  .  i 
ii  I  i  in.'i'lr  Mill  'll!-.  .ioi'H  lii'i  I  (tu-  atain Hill i \ 
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(228;  UPA)  (a)  Rex  wa  John  ga  suki  da 

Rex  TOP  John  NOM  fond  of  is 
(Rex  likes  John) 

(b)  Rex  wa  John  ga  mita 
Rex  TOP  John  NOM  saw 
(John  saw  Rex) 

(229;  UPa)  (a)  Rex  ni  John  ga  hanasita 
Rex  DAT  John  NOM  spoke 
(John  spoke  to  Rex) 

(b)  John  ni  nihongo  ga  wakaru 

John  DAT  Japanese  NOM  understand 
(John  understands  Japanese) 

The  relevant  psycholinguistic  fact  about  these  constructions  is  that  they  are  unproblem¬ 
atic  ambiguities — no  matter  vrhich  interpretation  is  required,  no  garden  path  effect  ;irises. 
Because  NL-Soar  is  head-driven,  the  NPs  are  not  attached  until  the  verb  appears,  so  the 
ambiguity  never  actually  arises.  Thus,  the  critical  question  for  NL-Soar  is  not  whether  the 
structure  can  be  repaired,  but  whether  the  A/R  set  is  capable  of  buffering  the  NPs  such  that 
both  interpretations  are  possible. 

Consider  the  -ga  -ga  ambiguity  ui  (227).  Structure  (227a)  is  a  double-subject  constfuc- 
lion  in  which  both  -ga  marked  NPs  occupy  spec-IP: 

(230)  [ip  f/v/.  John  ga]  [„>  [w*  koibito gall] 

in  contrast,  (227b)  is  a  cotistruction  in  which  the  second  -ga  marked  NF^  occupies  object 
pi.>sition: 

(231 )  1//.  \yp  .lofui  ga|  \yp  !i-'  l,v/'  Rex  gaij)! 

Neither  con:aruction  causes  liiiticiiliy.  since  the  NPs  may  be  indexed  bv  multiple  iclatious 
in  tile  A/R  set; 


R-t  IIVCKS 


s['ee  IP 

COIM|)  V" 


I  ,\7'  John  |,  I  ,\7'  /\'e  ( I 
Iv/'  8px\ 


When  the  tlisambigu.iiin!,’  \eib  amves,  it  is  (H>ssible  to  tnn!i.l  eidit'i  mtei pieialinn.  A  smnlar 
c\|>lan.i.t:on  hulds  Im  (2.:S)  and  (22‘*),  wlnsli  exhilnl  dilteicpl  kinds  o!  Kica!  snlpecl 
coil  I  pie  I  1C  ill  ambn.’iniies  .See  (  Pnlclicll,  1  '^0  I  )  !m  detai  Is  o!  the  s  v  ntaci  ic  anai  vsis 

M.i/uka  i‘l  al  tIOSO)  piesiU'U  .in  miciesimj’  imnioblemal ,c  lapanc'sc  l  Oll^tl  ULlon  m 
main  \  ci  b/ielahec  tkiiise  ambnnmN 


I  .  c,'.  I  l’\  I  11;  lv)o  in  p.i  kiiilumc  ()  \ snul.s 

old  man  NOM  child  \i  c.illcd 
(  !  he  oki  Ilian  called  ilic  L'hdd  i 
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(b)  Roozin  ga  kodomo  o  yonda  zyosee  to  hanasi  o  sita. 

old  man  NOM  child  ACC  called  woman  with  talk  ACC  did. 

(The  old  man  talked  with  the  woman  who  called  the  child.) 

In  (232a),  the  NP  NP  V  sequence  roozin  ga  kodomo  o  yonda  is  interpreted  as  the  main 
clause  The  old  man  called  the  child.  In  (232b),  the  relative  clause  reading  is  required,  disam¬ 
biguated  by  the  appearance  of  Up  zyosee].  Unlike  the  familiar  English  main  verb/reduced 
relative  ambiguity,  NL-Soar  can  repair  the  structure  in  (232b).  Figure  8.2  shows  how.  The 
main  clause  interpretation  is  pursued  initially,  with  Up  roozin]  in  subject  (spec-IP)  position 
and  kodomo  o]  in  complement  position  of  [i/p  yonda].  Ne.Kt,  [ap  z}'o.see]  arrives  and  the 
CP  adjoins  to  [,v'  zyosee]  as  a  modifying  clause  (unlike  the  English  version,  the  relative 
clause  is  active,  not  passive,  and  therefore  the  clause  remains  tensed).  The  appropriate 
trace.s  are  generated  in  spec-CP  and  spec-IP  position  (in  the  same  manner  as  English  rel¬ 
ative  clauses).  The  spec-IP  trace  creates  a  local  inconsistency  at  the  IP  node,  triggering  a 
snip  of  I, VP  roozin].  [^p  roozin]  is  now  available  to  attach  as  the  subject  of  the  incoming 
[/P  to  hanasi  o  .vira],  and  the  repair  succeeds. 

Pritchett  (1991)  discovered  one  of  the  first  known  Japanese  garden  paths; 

(233;  GP)  Frank  ni  Tom  ga  Guy  o  syookai  suru  to  John  wa  iwaseta. 

Frank  DAT  Tom  NOM  Guy  ACC  introduce  coMP  John  top  said  Cali,sh. 

(John  made  Frank  say  Tom  introduced  Guy.) 

'Fhe  initial  .sequence  Ihiougli  to  is  taken  as  a  complete  coniplementized  clause  (the  internal 
structure  need  not  ta)nt  -rn  us  here;  for  details,  .see  (Pritchett,  1992)): 

(234)  1(7-  [//>  Frank  rii  lorn  ga  Guy  o  syookai  suru  j  to| 

(Tom  in  roduced  tiny  to  I  rank.  ) 

Ne\i,  |v7'./o/;/j|  IS  encounteied  and  left  imaitached,  waiting  lor  the  final  verb.  The  tinal 
verb  iswLscia  is  a  cairs.itive  veib  !er|amiig  lliiee  aigumeiils,  iiu  hiding  an  obligatorv  ni 
m. irked  causee.  Oiilv  !uo  ai gument.s  are  av.iilal>le  (a;.  and  the  initial  GP.  fhe  NT* 

\m-  I  rcink  ni  \  musi  be  reaii.ihsed  .i.a  an  .ii gjimeul  ot  h-  m'(.'.scrn|.  Howewr,  the  leuiiired 
snip  v\  ilhin  the  ( M’  is  not  loc.il  (u  the  VP  |  y/-  ne./.w/i,' |  w  hu  b  i.s  inissmu  the  argiimenp  die 
repai.i  tails.  lesuhing  m  .i  g.iulen  p.ith  idled 


K.2.2  Koroaii 

.  \  ii  anihii.’iiitN  sill  111. II  In  (2  t  d  .tbm  e  .ii  ;ses  iii  Kdic.iii  as  well  1  Pritehetl.  1  00,’ ) 

id’)  Kcliex  e\ke\  ( '.h.irk's  k.i  lol.i  c  ,iss  t,i  ku  [sU  h;e  k,i  ni.ihi.i  ke\  h.i\  ss  i.i 

(Ku  l'-ie  made  Kclie\  s.iv  Gb.nk's  letiiinet.l  i 

\ s  in  i  '  d  d;.'  Ill ! ;  i,,i  u  nee  Ihi > nigli  ■  ■  .  / ss  lo  id  '  ni.i\  be  t.ikeii  .e,  ,i  si n -pie  e l.m se  \\  tien 

i  he  e.ii!  -..it  I  \  e  \  ed'  .ii  i ;  \  e ,  ihr  NT'  |  Kt  i  if  \  |  iiiiisl  be  le.iii.iK  sed  as  ihe  i  .lU's  c  the  n.s  |ii  1 1  ed 

snip  IS  iioi  isaieialed  and  dw  lep.iii  i.ids 
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(1) 

CP 

(2) 

NP 

1 

1 

i 

NP  VP 

CP 

1‘oozip  ga 

NP 

V 

IP 

N 

kodomo  0 

yonda 

/  \  zyosei; 

NP 

roozin  ga 

NP 

V 

kodomo  0  yonda 

(3) 

NP 

(4) 

NP 

N’ 

Tj- 

CP 

NP 

NI> 

o, 

"'e, . 

N 

/ydsee 

NP 

0, 

IP 

y 

N 

's.^  /vosee 

NP 

NP  VP 

NP 

VP 

I, 

roD/inga 

!( 

NP 

V 

NP 

NP 

\' 

kmii  1111(1  o 

yoiui.i 

loo/in 

klldollU 

IM 

’O  yorul.i 

i'HlUK!  S  .2:  Kop.iii 111;;  .in  nnproliiriiuilK' Ja]iaiK'se  iii  iiii  vcih/irkitive  .  lansn  aiiibipiiilN', 
Cl*  IS  aliai  iu'i!  ,ts  .i  iri.iiivc  iiuhIiIr-i  ol  ilk-  iiKoiiiiMp  Nl*.  Uskinip  (i>  (hn  cicalRni  ol  .i  ti.kC 
This  lii;'!'c'is  ilic  snip  (it  ilk'  Miisan.ilvsi’d  N!’,  inakiiisi  il  asailalilc  h'  si  nn  as  sn|iji\'[  nl 
i-iansf 


S.2,3  Mandarin  C  hiiitsc 

Kfcall  iIk.-  ui:|iii)bl(;!naln  Mili|fi.  l/(>ii|(.'i  i  aiiitnpiiils’  in  !  iipiish 

I  C),  I  M’  \  M  ( .1 )  I  ti'i  i'dl  ii'liii 

il'i  1  ii'ii'ol  lii|niv\k'ni  l< I  1  *1 1'h '(.’It 'll 

1  I K.'  s.iii  K'  k  ilhi  cl  iiii|'i  cl'li.'iii.ii  il  ai  iil'is'lili  \  .u  .M'S  in  \  la  ill  1,11  111  .a.  \n  a  1 1  i  !  ’i  ili  f  lai  I 

i,''  1  r'\)  i.i)  W  c  u  .il  n;  li'  /'li.nc’s.iii 

I  till I'i'i  I’l  I’ I  / 1  i.iiii's.n  1 

I I  Ic;  '!  /Ii.iin's.ni . ) 


S  he  initial 
■Il  sp(,k  n* 

'  ilk'  main 
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(b)  Wo  wang  !e  Zhangsan  yao  qu. 

I  forget  PERF  Zhangsan  will  go. 

(I  forgot  Zhangsan  would  go.) 

NL-Soar  repairs  the  Mandarin  structure  just  as  it  does  the  English  counterpart — the  incoming 
clause  attaches  as  the  complement  of  [v  forgot],  triggering  the  local  snip  of  [/v/>  Zhangsan]. 
In  the  same  way,  NL-vSoar  correctly  predicts  the  following  subjecEobJect  garden  path 
(Gorrell,  1991): 

(238;  GP)  Zhangsan  yi  du  shu  jiu  diao  le. 

Zhangsan  as-soon-as  read  book  then  fall  PERF. 

(As  soon  as  Zhangsan  read  the  book  fell.) 

hvp  shu]  attaches  as  the  complement  of  [y  du],  and  the  initial  clause  adjoins  to  the  incoming 
[jpfeU].  However,  the  snip  of  [/v7>  s-Z/w]  is  not  generated,  as  in  the  English  case  ((149),  1 ). 


8.2.4  Hebrew 


The  sLih)ect/object  garden  path  arises  in  Hebrew  as  well  (Pritchett,  1992): 


(239;  GP)  Axrey  shc-shatiti  maim  hitgalu  be-b’er. 

After  COMP  drank-ls  water  were  found  in  the  well. 

(After  1  drank  w.iter  was  loand  in  the  wci!.) 

I  he  e.xpianation  is  the  same  as  ui  tlic  Mandarin  and  English  e.xamplcs:  the  local  snip  is  not 
generated  to  remove  j  v/^  nuiint]  from  complement  position. 


8.2.5  (iertniEU 

( 'KH'ki'i  (  I  '191))  [iresents  .in  esanifve  ot  .i  r,.ii(len  path  m  ( iemi:in  no'oK  ing  an  objis  t./olriei.  I 
ambiguity: 

(..’•It);  tiP)  d.ib  dei  I'jitdeeker  von  .Amenk.i  erst  im  IS  .lain liiiii'.iei t  erlahreii  hat 
th.it  the  iliseox ei'ei  ol  .Ameiie,'  llI.^t  m  IStli  eentius'  ieameil  ol  h.is 
ilh.it  the  diseoveiei  onginalh  ie.smed  oi  AmeiR.i  m  the  i  .Stli  eeiiturv  i 

hhe  IT  !/■/  ii7.'  Anu’nka]  is  .■uti.ili\  t.'kcm  .is  llie  eoinpleuH  nl  ol  E  ijmif,  kt  r\  to  torm  the 
\!T//(  M'-'O'/  Ilf  knii'ru'ii 

(.’111  I  del  I  ■  I  utdei  kei  |  /  v-  Mill  Aiiiei  iha  ] )  I 

W  hell  I  I'fiiini'n  hiii\  aiioe-,  ir'i  I S  /ahr  iiiinji  i  f]  rs  .nljoiiied  .i,s  .1  iiienlitie  i  ifainti! 

.ei|iiii.'.  iwii  .loMuneiii'.,  hii!  iiiiK  '.ine  is  .0  .iihihle  \  tin  !  nUiii  kt  1  wn  Air.i-t  tLa\  I  tie 
I  ’i '  I  1’  \  i'-!  Ant,-J  ik,t  \  mils!  tv  I  eamih  sed  .is  oi)|eei  i.'l  ■  hih/rn.  )  low  e  .  ei ,  t!u.'  ;  epii  n  ed  snip 


H  3.  Summiify 
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TabLK  8.1 :  Surnniaiy  of  NL-Soar's  cross-Hngui.stic  coverage. 


■Ab 

Acceptable  3-NP  subordinate  clause 

(222) 

German 

PB 

Difficult  3-NP  center-embedded 

(223) 

GP 

Object/cbject  garden  path 

(240) 

Hebrew 

GP 

SubjecUobject  garden  path 

(239) 

AE 

Acceptable  3-Nr-ir.itial 

(220) 

AE 

Acceptable  4-NP-initial 

(221) 

AE 

Acceptable  S-NT  initial 

(224) 

PB 

Difficult  5-brP-initiai 

(22.5) 

PB 

Difficult  .3-NP-initia.l 

(226) 

Japanese 

UPA 

Unproblematic  -tja  -ga  ambiguity 

(227) 

UFA 

Unproblematic  -h-j  -ga  ambiguity 

(228) 

UPA 

Unprcbiernatic  lu  -ga  ambiguity 

(229) 

UPA 

Unproblemadc  main  verb/relative  ambiguity 

(232) 

GP 

Dative/C  A  USEE  garden  pam 

(233) 

Korean 

GP 

Dative/C.AUSEE  garden  path 

(235) 

UPA 

Unproblematic  subjccUobject  ambiguity 

(237) 

GP 

Subject/object  garden  path 

(238) 

is  not  generated,  because  tlie  conipiement  relation  is  nut  io< -,i  to  the  VP  with  ti:£  missing 
argunient.  As  a  result,  the  repair  faiis\ 


8.3  SuHifiiary 

Table  8,1  sunirnanzcs  the  examples  analysed  in  ih;,>  chapter.  Although  the  range  of  con¬ 
structions  is  small  compared  to  the  substantia!  F^ngii.ah  colleciion  addressed  in  C^haiftcis  5 
and  6,  ihe  varictv  is  gieai  enough  io  establish  NL.  Soar  as  a  viable  candidate  ‘or  a  iini’''e'.'';al 
comprehension  theory. 

Nl.-Soai’s  predictions  about  NP-stacking  in  head-final  |■••^guages  nuiy  lx  eounter 
intuito  i‘  to  the  native  Enelish  speaker,  but  the  eonirasts  aincng  dt<  slackv  ci  Nf^eoastruePons 
tparticula  \  (220)  (224),  (22.S),  and  (226))  provide  .xidiiK'nal  support  lor  tiie  strunure  v4 
the  .A/P  st’i.  These  head  final  structures aie  nnportant  because  ihey  piernnl  lesling  ihe  theory 
in  wavs  b:.:!  arc  simply  no*  possible  vv.uh  head-  initial  i-anguagcs 

Nl.Oaiav  aPo  estahli.shcs  that  hcad-dnven,  bott<.tn  up  parsing  need  not  predicl  undue 
difficultv  in  head  tin  il  languages,  as  e-.  sometimes  sigrpv.-'S-ed  (I'la.oei,  !OH7),  Prediciion.^ 
o!  Jilt'cnltN  must  in.'  madr*  wi'h  respeci  to  preeiselv  ariieulatcd  ", sump. ions  .aaout  the 
utidcrK  mg  nus  liani;-  n.:,  supp./rtnig  (.aniipiehcn -non  1  he  as'-erl.'on  ifi;.: '  'lul'Ii  r  iiig  <,dd,(!!(.mal 

A,  Minil.u  .III il'is. ii!l \  ■  i'i '.  1 -.I'.'mj.'  ,in  t.iliiei. l/,i(t)i!ni.  l  .iuiIiisiliiIv  ■'  im  kis,  I'Cl"/)  i.itii  rr  .:  c  i,U'.r  ,i 
[i.iih  '.-‘Iri  i  i'r iU- hf' I  M''U,  .  t- >, jil.ii n-.,  till',  t'N'  -.issf  ling  tiial  at'iamas  -iO,  U;  I'.  „ui  ii  i;c'J  m 

I  i  ,Si  I  lit  "  r  r  j  .1,1  '.in;.:,  tint  ,,  ">  ni  >1  ak  ;n'  !k)\>.  mi-;  i  a.  -■  ir  .ilf-;’ v  sin  lUiii  re.i!  i. !  in  a-  '  .S,  ai 


2!4 


Chapter  8.  Cross-h/iguistic  Phenomena 


NPs  always  increases  difficulty  carries  with  it  implicit  assumptions  about  the  structure  of 
short-term  linguistic  memory.  NL-Soar  clearly  demonstrates  alternative  mechanisms  are 
possible  which  dc;  not  necessarily  predict  overload  wids  stacked-NPs. 

Trie  predictions  for  the  garden  path  and  unproblematic  ambiguities  provide  additional 
support  for  NL-Soar’s  repair  rnsclianism.  It  is  important  to  establish  that  .str!.icture8  identical 
to  GP  structures  in  English  also  yield  garden  path  effects  in  other  languages.  Just  as 
importantly,  the  success  of  the  theory  on  structures  that  have  no  counterpart  in  English 
increases  confidence  in  !i«Tiited  repair  as  a  universal  mechanism.  We  also  saw  another  clear 
example  of  the  structure-.sensitivity  of  the  theory;  the  main-verb/reiative  clau.se  ambiguity 
in  Japanese.  On  the  surface,  this  ambiguity  is  quite  similar  to  the  English  garden  path,  yet 
NL-Soar  correctly  predicts  that  the  srructure  can  be  unproblei'natically  repaired. 

This  chapter  completes  the  detailed  application  of  NL-Soar  to  psycholinguisfic  data.  In 
the  next  and  final  chapter,  we  step  back  and  'valuate  NL-Soar  as  a  comprehensive  mode! 
of  sentence  processing,  and  place  the  corn  /d  of  some  closely  related  theories. 


Chapter  9 


Genera!  Discussion  and  Conclusion 


There  are  more  things  in  an  architecture,  Horatio, 
than  are  dreamt  of  in  your  theorizing. 

—  Allen  Newell 


Now  THAT  THE  PHENOMENA,  theory,  and  predictions  have  ^^een  desciibed  in  deptl:, 
we  can  step  back  and  evaluate  NL-Soar  as  an  integrated  psycholinguistic  model, 
and  situate  it  within  the  context  of  other  sentence  processing  theories.  This  chapter 
first  presents  a  summary  of  the  model  and  its  predictions,  followed  by  a  discussion  of  the 
theoretical  and  empirical  consequences  of  embedding  NL-Soar  within  the  Soar  architecture. 
The  next  section  explores  several  theoretical  issues  (e.g.,,  individual  differences)  that  did 
not  receive  adequate  attention  in  the  previous  chapters.  Next,  some  closely  related  theories 
are  discussed  and  compared  with  NL-SvOar.  The  thesis  closes  with  a  look  at  challenging 
areas  for  future  work,  and  a  brief  conclusion. 


9.1  Sl  Jimary:  the  model  and  its  predictions 


N]  ,-Soar  is  fiisr  and  foreniosi  a  fi  nclional  model  that  posits  coiiiputationa!  meclianisms  foe 
rcaliziag  tlio  tusk  of  compiehension.  The  model  is  based  on  an  independently  developed 
theory  of  the  t ogintivc  architecture,  which  specifie.s  the  basic  control  structure,  nicmory 
s!i uc'crcs,  processing  priniiti'^es.  and  learning  mechanism,  Table  9.1  summarizes  the 
fundauumla!  juincipies  of  Nl.-Soar,  all  of  which  are  described  in  detail  in  ChapJer  3. 

Itibic  9.2  summarize  the  predictioirs  of  NL-Soar,  as  described  in  Cnapters  4-8.  AU  of 
dicse  prciJiclions  derive  from  interactions  of  the  basic  pnnr.if.)lcs  of  iiic  moded,  .uni'  basic 
principles  of  Sour,  Many  of  the  predictiorA  are  novel  {these  are  marked  n'l  the  tribic),,  in  tfiat 
Nf.-Som  is  ifit:  firs!  theoiy  to  clearly  make  tiie  prediction 


In  addition  to  these  geneivil  pred'ctions,  Nf  , -Soar  provides  a  detailed  acetfUjit  of  a  wide' 
i.iiija '  of  ym  (.!cn  jiatl)  effects,  uoproblcinatic  atnbignttscs,  pavo.ng  lu'eakdown,  and  luxxyvi. able 
CinbcddMiy ItH'  ino<itd  has  been  applied  to  a  eofiection  of  i!8  diiferenr  conssrcci'oris 
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Table  9.1 :  The  ba,sic  characteristics  of  NL-Soar. 

1 .  Comprehension  operators  (incremental  u-constnictors,  s-constnictors,  resolve  operators) 
Comprehension  as  a  continually  improving  mix  of  deliberate,  recognitional  behavior 

3.  Models  representation  of  syntax,  meaning,  and  reference 

4.  Limited  syntactic  index  for  utterance  model 
5  Context-independent,  paraliel  lexica!  access 

6.  Head  driven,  constraint-bared  construction  of  utterance  model 

7.  Simple  destructive  repair  mechmiism 

8.  Reference  resolution  as  recognition  of  semantic  descriptions:; 
reconstructive  memory  of  discourse 

9.  Open,  mixed  paraliel/serial  control  structure 


repre,senting  lhe,se  phenomena  (including  17  cross- linguistic  examples),  with  a  success  rate 
of  about  92%  (108  correct  predictions).  The  results  are  summarized  in  Table  9.3. 

Figure  9. !  provides  a  qualitative  comparison  of  NL-  Soar  to  some  other  related  theories, 
evaluating  each  model  with  respect  to  the,se  part.’cular  sentence- level  phenomena.  Although 
this  comparison  does  not  take  into  account  all  of  the  considerations  irnpoitant  to  the  other 
theories  (e.g.,  individual  differences),  it  shcyuld  be  clear  from  Chapter  2  that  th  'se  phenomena 
form  an  important  core  to  be  addressed  by  any  model  of  sentence  comprehension. 


9.2  The  role  of  the  architecture 

NL  Soar  is  deeply  shaped  by  the  Soar  architecture.  By  novv  it  should  be  clear  that  Soar 
i.s  more  than  just  an  iniplemeniation  language  for  NL-Soar.  All  of  the  fundamental  prin- 
cip!,,;s  of  Soar  have  theoretical  and  ultimately  e.mpirical  consequences  for  the  model.  A 
few  examples  w'ili  lieip  further  clarify  the  point.  Consider  Nl.-Soar’,s  control  structure—  it 
IS  the  coKtrol  structure  of  Soar.  This  loads  directly  to  ihc  'speii  nature  of  ambiguity  res- 
oluiivia,  as  well  as  the  flexibility  for  error  recovery  (§4.2).  Soar's  recognition  memory 
and  control  strnctuie  together  lead  to  several  of  the  interesting  limitations  of  ambiguity 
ivs'iiiition  (§4.1),  and  the  distinction  between  automatic  and  deliberate  {irocesses  (§7.1). 
The  (.onlinuous  learning  mechanism  of  Soar  le-ids  to  the  prediction  that  variou-,  aspects 
of  fotnprefieM.sion  can  be  niodniateii  by  experience  (§4. 2, §74  ,i,  h,  also  provides  the  bas.c 
mecfumisrri  Lir  assembling  the  n'(|Uired  lecognitionai  cornpichension  o[)eraU)rs  (§3.3.4). 
The  temporal  manping  of  Soar  'S  wliat  pcimils  the  zero-oararncier  cliionornetric  prcdic 
of  •  ('‘!ii[)reheit'Uon  rales  and  the  sclative  time  coui:Ne  of  tonqiichensio!!  (§7  2. §7. 3). 
The  nalur-  o!  ciimikinp  leads  to  a  reconstiuctivi*  nienio,ry  of  cn,mprehejKlc(i  contenl.  The 
eomeni  to?  eiiimciiey  m  piuhleiri  sfiace  search  and  knowfedge  search  •  mcogriitum  matcli) 
niuiivaics  foe  liniiii.‘!i  repair  Miecharuvm  (§3.3.3!. 


218 


Chapter  9.  General  Discussion  and  Conclusion 


Table  9.2:  .Summary  of  NL-Soar’s  predictions. 


AR 


PB/AE 


GP/IJPA 


Inimn'C 


1 .  On-line  ambiguity  resolution  potentially  open  to  any  knowledge  source 

2.  Ability  to  detect  ambiguity  limited  by  syntactic  working  memory 

3.  Recency  effect  beyond  2  attach,  sites  for  same  relation,  ceteris  paribus 

4.  1  Inder  press  of  time,  relevant  knowledge  may  not  be  available  to  resolve* 

5.  Linguistic  Einstellung  may  occur  (masking  of  deeper  knowledge)* 

6.  Certain  ambiguities  (e.g.,  subject/object)  not  immediately  detected 

7.  Ambiguity  resolution  behavior  modulated  by  learning* 

8.  Parsing  breakdown  independent  of  ambiguity 

9.  Insufficiency  of  embedding  depth  alone  to  cause  parsing  breakdown 

10.  Sharp  drop  in  acceptability  at  two  center-embeddings 

1 1 .  F^arsing  breakdown  independent  of  length 

12.  Parsing  breakdown  independent  of  short-temi  item  memory' 

13.  Limited  effect  of  explicit  instruction  and  training  on  parsing  breakdown 

14.  Potential  for  effect  of  semantic  content  (assuming  item  memory) 

15.  Stacking  NPs  sometimes  acceptable  in  head-final  languages 

16.  Garden  path  effects  a  fonction  of  context,  experience,  and  stnicture* 

17.  Gaidcn  path  effects  recovered  from  by  careful  reprocessing* 

18.  Gard  .m  path  effects  bidirectional 

19.  Gar  ien  path  effects  largely  independent  of  length 

20.  Structurally-modulated  distance-to-disambiguation  effects  can  arise* 

21 .  Lc.xical  ambiguity  neither  necessary  nor  sufficient  for  garden  path 

22.  -Semantic  ambiguity  not  sufficient  for  garden  path 

23.  Comprehension  incremental  and  immediate  at  all  levels 

24.  Comprehension  is  mix  of  recognition  (automatic)  and  deliberation; 
referential  processing  more  likely  to  require  deliberation* 

25.  Rate  of  skilled  comprehension  is  ~230  words  per  minute* 

26.  More  time  on  coiPent  words  than  function  words 

27.  More  time  on  complex  syntax  than  simple  (lens  id  ms*) 

28.  More  time  on  disambig.  region  when  wrong  path  chosen  (tens  of  ms*  ) 

29.  More  time  on  lilled-gaps  (tens  of  rns*) 

30.  More  time  on  ambig.  regions  than  unambig.  (tens  to  hundreds  of  ms*  ) 

3 1 .  More  time  on  novel  aspects  than  familiar 


Ak  sdiK  UJral  aint,>!euity  lesnliitioii,  t’H  -  parsiet;  breakdown,  Abi  acceptabie  (rmbeddinj's 
( il’  ;  garden  path  i  fleets,  i  fl’.A  iinpiobleniatR  ambiguities 
Iriim  -  iminrdiai  V  of  micrpieianoi;,  IC  time  i  oursc  ot  i  (iriipri'hensnin 
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Table  9.3:  Summary  of  NL-Soar’s  predictions  on  the  GP/UPA/PB/AE  collections  (includ¬ 
ing  the  17  cross-linguistic  examples). 


Collection  type 

Number  of  items 

Correct  PREoicnoNS 

%  CORRECT 

Unprobiemtic  ambiguities 

36 

33 

92 

Gaiden  paths 

31 

29 

94 

Parsing  breakdown 

20 

17 

85 

Acceptable  embeddings 

31 

29 

94 

Total 

118 

108 

92 

The  richness  of  architecturally-based  computational  theories  is  also  reflected  in  the 
variety  of  ways  that  such  theories  may  be  used  (Newell,  1990).  The  theoretical  derivations 
in  Chapters  4-8  exhibit  this  variety.  A  number  of  important  qualitative  predictions  were 
derived  from  the  basic  structure  of  NL-Soar  and  Soar.  Many  detailed  predictions  v/ere 
verified  by  system  runs  (the  cross-’inguistic  predictions  were  made  by  hand-simulation). 
Approximate  temporal  predictions  were  generated  in  several  ways:  directly  examining  the 
structure  of  the  system,  making  estimates  of  model  behavior,  and  using  traces  of  actual 
system  behavior. 


9.3  Some  theoretical  issues 


Several  important  theoretical  issues,  such  as  modularity  (§4.4)  and  parallelism  (§3.6.3) 
have  already  been  dealt  with.  This  section  examines  a  few  other  relevant  topics. 


9.3.1  The  A/R  set  and  the  magic  number  two 

riie  structure  of  the  A/R  set  and  the  limit  of  two  nodes  per  strueturai  index  plays  a  key  role 
in  the  predictions  on  p»!rsmg  breakdowm  (Cliapter  5).  I'he  motivation  for  this  structure  goes 
beyond  its  ability  to  correctly  predict  the  difflcully  of  center-embeddings.  The  following 
summarizes  the  functiotial,  psycholvigicai,  and  computational  foundations  fortlie  A/R  ^et. 


I'imciiinuil  moftvalum  lor  A/R  .u’t 

!  he  i)asu  strucUm’  o1  tlic  A/R  set  is  designed  to  cflectively  com|)iei.iend  seiitciH:cs  (  §  1.3  3). 
Ii  indexes  the  [la'Iial  syntactic  stiuctuie  in  a  way  tliat  makes  generating  nev\'  sti  ui.  tuisii 
lelalions  and  nilci|)!etiiig  ex’sting  ones  a  sim|)le  inatcli  piocesi .  fu!  tl;iermoie,  the  partRad.u 
set  ot  disc!  iminatnig  rciations  ate  not  arbitiarv,  but  arc  derived  directly  liom  .X  b;n  syntax. 
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Interference  in  short-term  memory 

As  described  in  §5,1,  the  A/R  set  can  be  characterized  as  a  kind  of  syntactic  interference 
theory.  Although  content-dependent  interference  has  not  been  imporiant  in  theories  of 
linguistic  short-term  memory,  it  has  played  an  important  role  in  classic  short-term  memory 
work  emphasizing  phonological  working  memory.  (Interference  is  also  important  in  long¬ 
term  memory  work,  though  that  will  not  concern  us  here).  Three  important  principles  to 
emerge  from  this  work  are  decay,  chunking,  and  interference  (Baddeley,  1990;  Simon  & 
Zhang,  1985).  The  models  of  Baddeley  and  Simon  and  Zhang  both  assume  a  store  that 
holds  phonological  information  which  decays  in  about  two  seconds,  unless  the  mem.ory 
is  refreshed,  through  overt  or  covert  rehearsal  The  short-term  span  is  thus  whatever 
subjects  can  articulate  in  about  two  seconds.  Interference  shows  up  as  the  phonological 
similarity  effect.  The  span  for  phonoiogically  similar  items  is  greatly  reduced  (Baddeley, 
1966;  Simon  &  Zhang,  1985).  For  example,  Baddeley  (1966)  found  that  immediate  serial 
recall  of  five  one-syllable  words  dropped  from  about  80%  accuracy  to  10%  accuracy  when 
phonoiogically  similar  word  were  used  {map,  can,  mat,  cap,  mad,  man)  in  contrast  to 
phonologicaly  dissimilar  words.  Simon  and  Zhang  (1985)  conducted  the  most  extreme 
possible  test  of  phonological  similarity,  using  sequences  of  Chinese  character  homophones. 
For  example,  they  used  a  sequence  of  orthographically  and  semantically  distinct  characters 
ail  pronounced  “gong”.  Span  for  characters  dropped  from  six  to  seven  for  nonhomophonic 
sequences  to  two  to  three  for  homophonic  sequences. 

We  can  only  speculate  at  this  point  about  the  relationship  between  the  classic  theories 
of  short-term  memory  and  the  structure  of  the  A/R  set.  However,  a  consistent  picture 
that  emerges  i.  ane  that  characterizes  human  short-term  memory  in  terms  of  indexing  and 
di.scrimination  (using  phonological  features  in  the  case  of  phonological  memory,  syntactic 
features  in  the  case  of  syntactic  memory),  with  severe  limitations  on  the  ability  to  represent 
indiscrirninable  contents.  There  may  even  be  some  indication  of  commonality  across  the 
two  domains  in  terms  of  what  that  severe  limit  is:  the  magic  number  two  of  the  A/R  set  and 
the  2-.3  span  lor  pure  homophones  in  the  phonological  case.  In  general,  however,  there  are 
no  reasons  to  expect  that  the  constants  associated  with  decay  rate  and  interference  effect 
should  be  the  same  across  domains  (tlic  decay  rates  are  certainly  different  for  visual  and 
auditory  short- term  stores  (Baddeley,  1990)). 


(  'omputational Jo undai ion 

As  mentioned  m  §5  .3, 5,  tiiere  is  giiod  ('omivjiatioiial  reason  to  e  .pect  limits  on  uticliscruii 
inalcd  sets  '!  he  w'ork  on  the  recognition  match  in  .Soar  identifies  undiscriminated  sets  as 
(fie  [)iimaf  v  sonict.'  ol  potentially  exjionential  niatcii  expense  (  rarnbe  ct  al.,  19‘)())  ( ine 

ol  the  kev  meihods  ol  eliminating  tinwanied  eoiifoinatorics  in  the  rnalcfi  is  to  completelv 
elmnnate  mullqily- \ allied  atlribiites  m  vvorkmg  mcriioiy.  ’!  lie  limit  oi  two  m  Ihe  /\/R  set 
;  (Miles  close  lo  tins  uni  atfrifiule  sc  heiin.'. 
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The  magic  number  two  in  sentence  processing 

The  empirical  motivation  for  the  constant  two  is  fairly  broad  just  within  the  area  of  lin¬ 
guistic  processing  addressed  in  this  thesis.  Apart  from  simply  predicting  the  difficulty  on 
center  embedding,  it  captures  a  wide  range  of  contrasts  between  acceptable  and  difficult 
embeddings  (Chapter  5).  We  saw  in  Chapter  4  that  it  also  predicts  a  severe  limitation  on 
the  ability  to  detect  ambiguity.  It  leads  to  structure-modulated  length  effects  in  garden 
path  phenomena  that  otherwise  have  no  explanation  (Chapter  6).  Finally,  it  captures  some 
interesting  acceptability  contrasts  in  NP-stacking  in  head-final  languages  (Chapter  8). 

It  is  not  surprising  that  the  number  two  shows  up  in  other  psycholinguistic  theories 
as  well:  Kimball’s  principle  of  two  sentences  (Kimball,  197.3),  A-over-A  Early  Closure 
(Church,  1980),  Gibson’s  modified  closure  principle  (Gibson,  1991),  and  various  architec¬ 
tural  theories  of  parsing  breakdown  (§2.5.4)  all  involve  two-ness  as  a  key  feature. 

9.3.2  Individual  differences 

Individual  differences  can  potentially  show  up  in  every  aspect  of  NL-Soar  that  is  realized 
by  associations  in  long-term  memory.  For  example,  ambiguity  resolution,  referential 
processing,  and  even  the  ability  to  recognize  complex  syntactic  constructions  can  all  be 
modulated  by  learning  in  NL-Soar,  and  thus  are  potential  loci  of  individual  differences. 
The  fact  that  NL-Soar  predicts  such  differences,  and  provides  the  explicit  mechanisms  that 
explain  the  nature  of  the  differences  and  how  they  emerge,  is  one  of  the  neve!  strengths  of 
the  model. 

Such  differences  may  be  characterized  as  knowledge  based  differences,  in  contrast  to 
architectural  differences.  Any  theory  of  individual  differences  must  provide  some  degree  of 
variability.  For  NL-Soar,  that  variability  is  in  the  content  of  the  recognition  memory,  which 
determines  what  aspects  of  the  comprehension  skill  arc  automatic  (chunked),  and  what 
aspects  are  deliberate.  NL-Soar  does  not  provide  variability  in  the  underlying  architecture 

However,  recognition  memory  has  a  considerable  amount  of  inertia  to  it--(jncc  a 
massive  body  of  associations  has  been  built  up  for  a  particular  skill  (such  as  for  compre¬ 
hension),  th.'it  body  of  as.sociations  takes  on  architectural  characteristics,  in  the  sense  that 
the  architecture  is  what  is  relatively  fixed  about  cognition  (In  fact,  we  saw  in  §4.4  liow 
the  comprehension  skill  of  NL  Soar  exhibits  various  aspects  of  modularity,  a  putatively 
architectural  concept. )  Even  though  the  comprehension  :,ki!l  in  Nl ,  Soar  may  be  modulated 
by  experience,  it  certainly  cannot  be  fuiniamentaliy  changed  cm  any  short  time  scale.  The 
change  must  be  rather  slow,  because,  any  local  processing  can  only  add  a  fixed  number  of 
chunks  duit  is  tiny  compared  to  the  total  amount  oi  associations  devotcil  to  ((mprehen 
sion  What  this  means  witfi  rcspiect  to  individria!  difTeiciiix's  is  that  tlie  distinction  between 
knowledge  Ir.-ised  and  archifcciuialiy  ha,sed  differences  becomes  somewhat  blurred. 

It  IS  nevertheless  still  po.ssible  to  consider  how  Nl.  Soar  ui.glit  tie  dianged  to  mtioduce 
jUrre  slriie'inal  vaiiabilitv.  1  he  obvious  taindidaic  (or  variability  is  the  iiiagie  eoiistant  two 
ut  tlie  .A/R  Sv*t.  Assuming  lliat  tliis  constant  rcllcitssomc  rathei  iiiiulamenta!  techiioiogica! 
iiiiiitalion  (see  liie  (.fiscussioii  above),  it  would  not  lie  unreason.itiie  to  sp.'cuhile  tliat  the. 
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limitation  may  shew  some  variability  across  individuals.  Such  variability  would  be  more 
along  the  lines  of  the  variable  working  memory  capacity  of  CC  READER  (Just  &  Carpenter, 
1992).  Although  developing  such  a  model  is  beyond  the  scope  of  this  thesis,  one  clear 
prediction  would  be  that  groups  of  subjects  of  similar  working  memory  capacity  will  still 
exhibit  the  same  contrasts  on  the  parsing  breakdown  collection  that  the  two-  valued  A/R  set 
produced.  For  example,  object  relatives  will  still  be  more  difficult  than  subject  relatives, 
center-embedding  more  difficult  than  right  -branching,  and  so  on — even  though  performance 
on  identical  embedding  levels  may  differ  across  subjects. 


9,3.3  Grammar  and  parser 

This  thesis  did  not  directly  addre.ss  the  relationship  of  grammar  and  parser,  but  because 
NL  Soar  is  functionally  com.plete  and  does  bring  to  bear  grammatical  know'ledge,  it  is 
possible  to  derive  some  answers  Irom  the  theory. 

NL-Soar  reflects  rather  clearly  a  competence/performance  distinction.  The  grammatical 
knowledge  in  the  system  may  be  given  a  characterization  that  is  independent  of  the  particular 
form  in  whicn  that  knowledge  is  held.  Furthermore,  it  is  easy  to  see  how  NL-Soar  exhibits 
various  performance  effects  that  make  it  fall  shoit  of  perfectly  realizing  the  competence 
grammar.  In  Chapter  2.3  on  ambiguity  resolution,  this  was  called  a  failure  of  the  symbol 
level  to  implement  the  knowledge  level — more  general  terminology  for  talking  about  the 
same  distinction.  The  limitations  of  ambiguity  resolution,  garden  path  effects,  and  so  on  are 
all  examples  of  NL-Soar’s  mechanisms  failing  to  perfectly  bring  to  bear  the  grammatical 
knowledge  which  is  nevertheless  represented  in  its  various  problem  spaces. 

More  can  lie  said  about  the  form  of  grammatical  knowledge  in  NL-Soar;  the  grammar  is 
in  a  compiled  form  v/hen  it  is  broughtto  bear  recognitionaily.  However,  thccoinpiled  aspects 
are  still  independently  repre.sented  in  lower  spaces,  .so  there  is  considerable  redundancy  in 
the  system  at  any  given  point  in  time.  As  pointed  out  in  Chapter  3,  the  view  cd'  grammar 
as  consisting  of  a  small  number  ol  interacting  principles  fits  well  within  this  structur.  The 
richness  of  fhe  interactions  among  the  grammatical  constraints  is  the  engine  that  builds  up 
(via  chunking)  the  large  set  of  etticicnl  parsing  associations  in  recognition  memory. 

Allhough  the  choice  of  governmeni  and  bmdmg  tlK  ury  was  not  parlicularly  rnotivaled 
by  psycholinguistic  data,  it  is  clear  that  the  choice  of  grammar  has  implications  tor  NL 
Soar  s  predictions,  I'lic  ,Nl.-So,ii  model  can  be  partially  abstracted  away  from  the  grammar, 
iiut  the  abstracted  theory  cannot  make  empirical  predictions.  I  he  predietions  depend  on 
tlie  onloloey  of  syntactic  relations  and  the  precise  slructures  assigned  by  the  gramniai,  for 
'his  reason,  (113  has  iiiicicmablv  iilavcd  a  role  m  the  success  of  the  tlieoi  y.  Of  course,  this 
does  not  iiiran  that  some  other  iticorv  could  not  h.,!vc  done  ecjiiallv  well;  suc  h  congiarisons 
e.ie  jKileiiti.illy  Iruitiui  f'li!  are,  i>e\ond  the  scope  ol  l.tns  llussis.  In  any  even!,  some  torrri  o! 
grammar  miis!  be  adopted  toi  iinK'!ion;il  n.-a'-vins,  and  it  is  die  nature  of  .Nl,  Soar  tlial  Itie 
ciioiec-  wi!l  li,tve  empineal  eonsccuieiiees. 

Till,  le  IS  oiv  Mupoitaal  eonslmiiil  Nf  .  Soai  does  [ilace  on  'u amniai u  al  theory,  il  mtisl 
iiicdipomte  ,loealnv  echistraints  .tnd  l.ireak  long  distance  deiH‘rk.lene!c's  into  a  senes  ol  local 
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Table  9.4;  Varieties  of  learning  in  NL-Soar. 


1 .  New  operator  creation  (newly  created  u-constructors,  s-constructors) 

2.  Search  control  (search  control  for  ambiguity  resolution) 

.3.  Learning  from  external  input  (reference  resolution  chunks  encode  recognition 
memory  of  new  content) 

4.  Operator  application  (for  new  comprehension  operators) 

5.  Learning  from  failure  (from  constraint  checks  on  link  operators;  also  from  careful 
reprocessing  of  input  triggered  by  semantic  anomaly) 


relations.  GB  clearly  satisfies  this  constraint  with  its  chain  formation  subject  to  subjacency. 
To  see  why  NL-Soar  requires  locality,  consider  the  severe  extraction  violations  in  (242); 

(242)  *Who,  does  Phineas  know  a  boy  who  hates  the  man  who  saw  t,? 

The  severe  limitations  of  the  A/R  set  means  that  the  partial  syntactic  structure  may  not  be 
available  to  establi.sh  the  necessary  relations.  In  (242),  NL-Soar  is  unable  to  establish  the 
long  distance  relation  between  the  object  of  .vr/w  and  the  initial  who.  By  the  time  the  final 
embedded  clause  is  encountered,  the  intervening  CPs  will  have  pushed  the  matrix-level  CP 
from  the  AYR  set.  Thus,  the  crucial  spec-CP  position  occupied  by  who  will  not  be  available 
as  an  antecedent  tor  the  object  trace; 


ASSIGNERS 


spec-CT; 


1(7*  who  hates],  who  .van'] 


thus,  syntactic  interference  effects  in  short-term  memory  may  explain  why  there  is  a 
requirement  for  .sotae  locality  con.straint  on  grammatical  repre.senta'ion.  But  at  present  it 
does  not  seem  possible  to  derive  the  precise  form  of  subjacency  or  any  other  empiricady 
adequak.  locality  constraint. 


9,3.4  Learning 

I  , earning  [leiineates  every  aspect  of  Nl.  .Soar  One  ol  tlie  central  results  of  .Soar  researdi  is 
tliat  niair;  vai  lelies  ot  learning  may  emerge  Ire  oi  a  single  chunking  mei.  hanism  v\  oi  king  ( ivcr 
dilterent  kinds  ot  jiroblem  solving  impasses  (Newell,  1990;  Steiei  et  al.,  l9iS/l,  N!,  Sisii 
ilsei!  exhibits  several  kinds  ol  iearnmg.  siinniiari/eil  in  i  igure  0  4. 

C'.luinkiiie,  111  NL  .Soai  ak'-o  iai:>es  a  nuinliei  ol  seriuir.  issues,  soiiu.'  ot  wIikIi  remiiin 
to  lie  lesolvetl,  Ol  eoiirse.  there  rs  the  qne:>t!on  ol  whetber  cliUnkiiu.!  is  adeijuate  to  the 
lask  ol  language  .icrjuisitu.ni  The  iiatiiial  hvpotliesrs  ol  Nl,  .Soar  is  iliat  il.ie  Icxirrui  .iiul 
the  .angu.igc  speeilic  a\[)ecl'-  of  giainmar  .ue  all  ;ic(|uned  \  i:i  ciiunking  iN.-uell,  looo), 
I 'Itinialelv,  siu  b  a  llieoi  v  w  ill  place  .uidnu'iiai  r  ■. 'iisti .unis  on  tlie  nalai'.  oi  tlic  lowa  i 
piolileni  spacr's  tti.l!  ,'ie  now  sir.ipK  posited  in  Itold  die  gi  ,iiniiialie;il  know  l.‘(ige 
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Another  related  fundamental  issue  is  the  masking  effect.  We  saw  in  Chapter  4  that 
chunking  can  produce  a  recognition  memory  that  may  mask  knowledge  in  lower  spaces. 
This  is  clearly  an  i  nteresting  psycholinguistic  prediction,  but  it  raises  concerns  about  whether 
this  limitation  will  in  fact  prove  to  be  too  severe.  It  may  not  be  possible  to  settle  the  issue 
without  understanding  the  basic  structure  of  acquisition  in  NL-Soar. 


9.4  Related  theoretical  work 

This  section  compares  NL-Soar  with  a  few  closely  related  theories;  the  production-system 
model  of  Just  and  Carpenter,  and  the  principle-based  parsing  theories  of  Pritchett  and 
Gibson. 


9.4.1  READER  and  CC  READER 

The  NL-Soar  model  is  theoretically  closest  to  the  READER  and  CC  READER  production 
system  models  of  comprehension  (Thibadeau  et  al.,  1982;  Just  &  Carpenter,  1987;  Just  & 
Carpenter,  1992),  (The  ACT  comprehension  model  (Anderson  et  al.,  1977)  is  another  good 
example  of  a  model  based  on  a  production  ,system  architecture,  but  the  READER  models  are 
better  developed  with  respect  to  NL  processing).  There  are  many  .strong  similarities.  The 
READER  models  arc  built  upon  a  general  cognitive  aichitecture  (CAP,s),  just  as  NL-Soar  is 
based  on  Soar.  Both  Soar  and  CAP.s  have  productions  as  a  fundamental  component,  and  as 
a  result  both  NL-Soar  and  READER  embody  the  immediacy  hypothesis.  Both  NL-Soar  and 
READER  are  functionally  complete  models,  in  that  they  posit  processes  to  handle  multiple 
levels  of  comprelu  nsion  (syruaci  :,  .semantic,  referential).  Both  theories  al.so  model  certain 
aspects  of  the  time  com.se  <u' comprehension. 

l  iunigh  a  full  comparison  ol  .Soar  and  (  At’s  is  beyond  the  .scope  of  the  present  discussion, 
it  is  worth  noting  a  number  of  key  differences.  C.Ai’.s  is  activatiim-based,  and  therefore  deals 
vvitii  coiitmuoiisl  V  varvmg  production  strengths  and  memory  traces,  winie  the  match  in  Soar 
is  discrete  .Soar  [losils  .i  level  of  control  [milt  on  U  p  of  the  production  system  fprolriem 
spaces)  and  tlKue  oie  uUK'iiuccs  an  .lutomalie/deliberaie  distmclion  1  he  total  activation  in 
(  Al’S  may  I'C  .id|i!s|t’(l  .is  a  leficciion  ol  woiking  luenK'iy  c.ipacitv.  wink'  Soar  has  csscntialiv 
no  sliuctuiaj  [>ai  nnetne  vari.ilioii  Soar  lias  a  contiruious  learning,  mectianisiii,  winch  is 
.ibsciit  m  ( '.M'S. 

As  a  result  i'l  die  underh'mg  .rr^.  hitectuies  and  olluu  assumptions  of  the  models,  theic 
asc  sevvial  signilK'.im  l.iIt!eFenee.^  between  NL  so;u  and  ilie  Rt  ADl  H  models.  Nl.  Soar 
makes  precise  saiii'Uiie  seiisiiic'e  piedielioiis  ot  gaiden  path  eifeels  anti  j>arsiiig  bre.ik 
deivcii;  preseniK  il  is  iml  c  ksir  Rl  \|)M-c  could  l>e  a[)pi:ed  in  det.ui  to  die  eollca.  hou  ol 
eoiisl! iielions  lineiudiii!’  tiu-  e!0',s  liiiRUislic  e.Ntimples)  that  loinied,  .iii  ntijioit.tiu  eore  ol 
Nl.  So.ti  s  pi  eiii .  tion.-.  r.'n  [tieiclhci  .ti.ind,  .N!  So.n  e.imio!  \c-!  model  perlumi.nic'e 
I  a  His  due  iia  mdi iclu.d  didc'i  caicc'  m  oi  k  mg  memorc  e.ij'.n  iR, .  Bodi  Nl  .S(,.ii  ,uid  to  mur 
■  liouki  I'C  afile  ti>  mode!  individn..e  diiieieuec's  Isised  on  know  k'c-ipc'  (oi  ■^kdll  dilleieriees, 
h\  ( lu'.ism  ■  , ! ! !  t(.  K'l ii  ..s,  j ■,  ( i|  ()i  I  id'ue! n Mis  or  asst iei,-i!ions  Hi ii  .Ri  .\l  s'  l-’.  dot  s  noi  | u ox  ide  t lu' 
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mechanisms  (learning)  by  which  these  differences  might  arise  as  a  function  of  experience. 
Although  both  models  model  the  time  course  of  comprehension,  NL-Soar  cannot  mode!  the 
critical  differences  in  fixation  times  due  to  word  frequency  and  length  because  it  does  not 
have  a  detailed  model  of  lexica!  encoding  and  access.  However,  NL-Soar  is  able  to  make 
zero-parameter  predictions  of  comprehension  rate,  due  to  Soar’s  temporal  grounding. 

9.4.2  Pritchett’s  model 

As  far  as  garden  path  theories  are  concerned,  NL-Soar  bears  the  closest  resemblance  to 
Pritchett’s  On-line  Locality  Constraint  (OLLC)  (Pritchett,  1992).  Both  models  embody  the 
Structural  Garden  Path  Hypothesis  (54),  and  as  a  result  both  exhibit  extreme  sensitivity 
to  differences  in  syntactic  structure.  Indeed,  Pritchett’s  original  theory  (Pritchett,  1988) 
provided  the  inspiration  for  a  structural  repair  mechanism,  furthermore,  both  models  are 
purely  head-driven,  bottom-up  parsers 

There  are  several  similarities  and  differences  between  the  OLLC  and  NL-Soar's  re¬ 
pair  mechanism.  The  OLLC  is  essentially  an  abstract  characterization  of  precisely  what 
structural  differences  between  the  pursued  and  correct  interpretations  will  yield  garden 
path  effects.  As  such,  it  represents  the  Structural  Garden  Path  Hypothesis  in  its  purest 
form:  there  is  no  commitment  to  particular  computational  proc.esses,  or  even  to  a  single 
path/reanalysis  model.  The  NL-Soar  theory,  on  the  other  hand,  posits  an  explicit  .set  of 
functional  mechanisms  to  handle  unproblematic  ambiguities,  f.dlimately,  even  if  a  char¬ 
acterization  such  as  the  OLl.C  proves  correct,  there  must  be  some  account  given  of  how 
the  computational  processes  of  compreiicnsion  yield  such  a  characterization.  Of  course, 
Nf.-Soar  predicts  that  an  account  such  as  the  OLLC  must  ultimately  ire  right-  the  repair 
mechanism  in  Nl.  Soar  fails  or  succeeds  purely  as  a  function  of  the  dilTerences  between  the 
[uirsued  and  retjiiired  symactic  sirucliiies. 

Given  lhe.se  similarities,  :t  is  iiileresting  to  consider  the  status  of  the  OLl.C  as  a 
i]i\unrtHilicciU\  ilciiveil  theory.  If  structiiK'  sensitu  ity  is  all  tliat  is  requued,  NL  So.u  i.s 
jiist  as  gi  ammaticaily  based  .is  the  OLl  .C.  Hut  by  giamni.ilically  dei  i\ed,  Pritchett  (  1‘t'LM 
means  more  tli.iii  this;  the  ciiicial  fact  is  that  (he  t  )Ll  ( '  is  ioiiiiulated  m  temis  of  luiula 
mental  relations  (dtnnm.incc  and  cove; iiineiil )  of  the  !.’ramm.ii.  file  signilicaucc  o!  this 
I ormiil.it n III  is  imcicar,  liowiwiu  tin.'  [>ai iiciil.ir  tonn  ut  the  ( >1 .1 .(  (a  di.s)imctu)n  ol  .iuim 
nance  .md  government)  has  no  independent  grammatical  si.itus,  .ind  it  is  j)crh.t[is  somculuit 
odd  ;o  expect  am  gi.imm.itic.il  sigiiilic.ince  to  nhtam  lor  lel.itioiis  eompuud  m  toss  ihl- 
l('nnl  .v///(i  /no’x.  as  is  the  c.isr'  with  tin.'  ()l  1  ('  Uii  tin.’  ntlici  liaiul.  we  shvHild  expcst,  it 
the  .SiiiKtinai  (  i.inieii  I’.illi  lixpotlicsis  is  s()!rei.I,  iti.il  s<i»ic  furnml  itmii  ol  .i  lean.ih  ms 

i  nils! ;  .1111 1  sIk  iiiKi  he  po.'.silile  111  Ua  Ills  oi  gi  .'.I  I  limit  ic.il  pi  nml  i  \  es,  sine.,  il  is  pi  l-.  isel  \  sia  h 
pi  ill  I  III  \  es  1 1 1,1 1  pi  1  >\  ule  I  he  l.im  ii.ige  h  ii  i.ilki  ng  .ihniil  .0[H'CIs  .  >t  n.i.a  lu  slim  lui  e  !  I  ms, 

ii  IS  mil  s(ii|Mismg  ili.i!  ihe  lo.  ,i!u\  conslmiiit  tniill  niio  the  geiiciMici  lui  \l  snip 

ii|H  i.il.'i  c.iii  t'..-  le.idilx  li>i  mill, lie.!  Ill  leiins  .d  a  simple  '.M.mimatic.il  ..nisiiu.l  ii.tiiu  !\ 

I  he  m.i  M  m.il  pi .  i|eel  mu' 

I  'ih I.'!  I'l ,  \  Ii-'.'.  ihc  le'e.ilil’,  .  1  'iisli .liiit  .  'I  l!;.-  Mill'  I ') S  I  ,!Ii  '1  e.  in  ' ii.i) i  r \  en  m  ii-|  -  lei  I  h.ii i  I i le  i  >1  i  ' 

Is. !  Pi  !  1,  U  ,i[i  l.isi-  ,  a  ,iiip  lii.il  I  he,  '.111;'  !(■  II  .e.  Ii.ini'.n!  ,  , , , ;  1 '.  .ill  ihe  lii.-.  ,u  \  i  i  ■  ii  i  Is  .ei  1 1  n ;  'ii  i :  i,  i  il .  ■ 
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The  head-driven  assumption  of  Pritchett’s  model  was  adopted  in  NL-Soar  because  it 
yields  the  minimal  set  of  operators  to  construct  the  utterance  model:  no  additional  processes 
are  required  to  build  expectation  structures  corresponding  to  top-down  predictions,  and  no 
processes  are  required  to  match  incoming  structure  with  predicted  structure  (see  (Lehman 
et  al.,  1991a)  for  a  descr  iption  of  an  earlier  version  of  NL-Soar  that  did  have  an  expectation- 
based  component). 

While  NL-Soar  incorporates  head-driven  processing  and  embodies  the  structure-sen¬ 
sitivity  of  the  OLLC,  ii  liffers  significantly  from  the  Generalized  Theta  Attachment  (CTAj 
principle  which  governs  ambiguity  resolution  in  Pritchett’s  model.  As  noted  in  Chap¬ 
ter  2,  the  most  serious  problem  with  the  GTA  and  any  other  syntactically-driven  model 
of  ambiguity  resolution  is  the  inability  to  account  for  interactive  effects  that  have  been 
established  across  a  range  of  contexts,  structural  ambiguities,  and  experimental  paradigms. 
Nevertheless,  NL-Soar  does  share  some  predictions  for  attachment  preferences  with  GTA, 
in  particular,  the  preference  for  objects  over  subjects  (§4. 1 .3L 

9.4.3  Gibson’s  model 

Gibson’:;  1991  model  was  ihc  first  to  make  detailed  predictions  across  a  broad  range  of 
both  garden  path  and  parsing  breakdown  phenomena,  and  is  still  the  only  theory  apart  from 
NL  Soar  to  do  so. 

Like  NL-Soar  and  the  On-line  Locality  Constraint,  Gibson’s  model  is  a  structure 
sensitive  tlvaory.  It  incorporates  a  structural  metric  that  a.ssigns  values  to  alternative  inter¬ 
pretations  Garden  path  ett'ects  are  predicted  when  the  metric  assigned  to  two  interpretations 
ditfers  by  moic  than  a  C(>u:,lant  factor,  leading  to  the  disposal  of  one  of  die  interpretations. 

I  hus,  till.’  Gibson  moilcl  .ilso  emboilies  the  Structural  Garden  Paili  hypothesis. 

fliough  Gilison  presents  the  itieors  as  a  memory  overload  model,  n  is  unclear  how  to 
mterpi  et  tlie  tlieor  v  iii  meehaiiisiii-  terms,  t  course,  a  eoinputational  impicmcntalion  of  the 
tticoiA'  can  tic  consiraclcd  laud  ( iilisoii  dui  coiisnikt  one)  ilial  obtains  ctlicicncy  gams  by 
using  die  sii  iiciiiial  mciiic  lo  prune  die  paisiiig  .seaieh  sfiai'c  bill  such  an  implenieiitalion 
eaiinol  lie  taken  as  .i  rugintise  model  iii  the  slrom.ds  equivalent  seaise  ot  Pvlv'shvn  |  !dX  l), 
ineaiise  die  I'lovcsMiig  siiqis  iii  Ife  inipleinciitatioii  .ee  not  held  lo  riaiespoiid  lo  die 

(H occssiiig  sieps  die  iiiiinao  conipreluMuiei.  i  I  he  iiuplenienialuin  ap[ihes  tin.'  dieoiv 

slraighdoiw  aidl\  it  gk'iiei.iteN  l!ie  allemadvc  inlerpieiaiioiis,  eoiiqniles  the  lueU  ie,  .iiut 
discards  sli  iieim  es  acis  u  duig  to  die  pi  iimne  o.di  )  Ilia  au  hileeiui  al  stains  ul  die  \  ,ii  loas 
c.eigiils  leinams  niiexpiu  aUal  Ihus.  die  [iies(,-nt  iheoiviie.d  \ahie  o|  die  ineliie  is  no!  lo 
[h‘  loinid  Ml  Us  u'.ih/alioe  m  .i  pioci  ss  iiio,iei  ImiI  iiisir.td  in  iis  pres  isr'  irlieiihuion  o! 
ihe  sUiielvii.d  ddlen  iu  cs  that  leavi  lo  e-iiden  n ‘di  rllis  is,  this  ahsinni  oiicipieMiiuui  ot 
the  du'oi  \  ] daces  ii  in  die  s.n iie  genei .il  eias  ,  o|  duoi  i ^  as  die  ( )i  I  d  ,  tliou gh  die  (  )|  I  < 

.r  a  oiiipl !  slu's  I  he  I  nui  I  loi;  ,oiiu">\  h.it  more  U  .iiisjMi  eni  l\ 

I  '  '  I  i  ^  '  t  ill.  '  M  (  (  I  I  t.iN  I  .  i  f  Ii!  t  I'  in  ‘1  f  I  •  1 1  ‘  l»  i  i  I!  -  ll  I  f.ilLl'  M  .  i:U‘x  ll.i'  1  i-.,  I  i  '  I  nil  i .;!  !  i  '  '.fn|  ‘  t'.  W  cl !  .1  .!  1 '  i  ;  1 1 , 1 !  1  i  . !  1 1 
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NL-Soar's  explanation  of  processing  breakdown  is  siirilai  to  the  explanation  pro  .  i>iec.! 
by  Gibson’s  theory,  in  that  both  identify  the  problem  as  primarily  one  of  buffering  unin- 
terpreted  NPs.  Indeed,  Gibson’s  analyses  paved  the  way  for  the  account  presented  here, 
Again,  however,  the  structural  metric  :riust  eventually  be  given  a  proces.sing  interprctati.rn. 
NL-Soar’s  A/R  set  orovides  such  a  mechanistic  theory  (though  not  one  that  directly  realizes 
Gibson’s  metric — the  two  theories  do  in  fact  make  different  predictions).  Furthermore, 
the  A/R  set  has  some  independent  functional  and  computational  motivation — and  perhaD.s 
psychological  motivation,  as  discussed  above  (§9.3.1). 


9.5  Challenging  areas  for  tutyre  work 


NL-Soar  not  only  raises  many  challenging  issues  to  resolve  in  the  current  model,  but 
potentially  opens  up  new  areas  of  theoretic.ai  and  emprical  inquiry.  This  section  discns.ses 
just  a  few  of  these  issues  and  area.c 


First,  of  course,  are  the  several  empirical  problems  the  medei  encounters  on  the  presCiii 
corp.ora.  A  number  of  tnissed  predictions  point,  to  a,  pos..db(e  problem  with  the  way  NL 
Soar  handles  cornolements  vs.  adjuncts  The  model  also  appears  to  somewhat  overpredict 
acceptability  of  difficult  embeddings.  Here,  the  challenge  wiH  not  just  involve  modifying 
the  theory,  but  acqui.ving  snore  relial  fe  data  ;ts  well. 

NL-So'ir  is  one  of  the  few  ps''choliagui.siic  models  of  .sentence  processing  to  incorporate 
continuous  learning  (beyond  acqui.sitson)  as  a  cc.hra!  component.  This  feature,  along  with 
the  atJfomatic''dciiberate  distincrion  inherent  in  NI  ‘toar,  rna>  provide  a  way  to  unify  the 
increa.sing  amount  of  psychological  data  addressing  this  distinction.  The  deliberate  garden 
path  recovery  model  piesented  in  Chapter  4  is  just  one  e.xample  of  how  the  theory  can  lie 
applied,  h  i., -Sear  could,  m  geat'/al,  open  up  new  area.s  of  study  c oncer amg  the  impact  ol 
icarnirig  on  v.irious  aspects  of  panning  and  imerprefvdor;. 

Becau;.':'  NL,  So,:ir  is  ern'oedded  m  a  general  cognitive  theory,  it  offers  the  oppouunity 
(o  study  the  iiHcgran.vi  of  iangnage  cotuimehertsson  with  other  tasks,  including  language 
i^enerafitin.  in  artificial  intelligence,  the  integration  of  r*  irnprehcosion  is  already  under  way 
vs'itii  F  bluin', %  ( loy'j)  work  on  ins.riK'tabie  agents.  Other  work  m  the  N.L-Soar  project 
a!  C’Mi.'  IS  investigating  the  iow-levc!  i.nteileavaig  of  coniprciscasion  and  generation  with 
olhs'i  ‘ask  oper,.)l{.'rs  in  tfic  dematuhng.  rs.'ai  I  into  citvironnicnt. '  if  tia.  in' AS  A  lesi  director  (the 
iiulividual  reqton.sibie  (or  lauiiciiiiig  the  .space  shuttiei  (Ni-.-iS<..<n  c!  af,  1904).  .Apart  from 
the  iCiil-tinK'  mtt'giaiion  <•'  cnn)[>Fi,'fij.n,ston  and  gcnciat'On  piocc  ,se.s,,  t.he  generalion  work 


soiiils  the  exploration  cit  siiareo  knowiedge  '•!.urec>  and  wo;}, uw  ineuiciry  struclurcs. 

i  iii.d!\,  the  very  Minjiic  slructhte  rT  the  A;'R  scr  makes  d  a  goc'd  .aralidate  for  exploring 
.air, si  iaaind-tUOiiN  aad  9c v clipping  ncio  ork  sn'ipleiviematKsur-.  ssoracswold,  A Ipcri,  R.iusch,, 
id  t  ajdan  fuive  eircadv  mui ita.i  i.in  muiginu  expenmems  (k'F'Ll  with  \ub)ec!s 

adine  cmoe(.L.ic(!  clauses  Kicnofv'ne  a  p^ient 
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9.6  Contributions  and  conclusion 

This  thesis  began  by  setting  forth  four  criteiia  for  the  comprehension  model  to  be  de¬ 
veloped;  breadth,  depth,  architectural  basis,  and  functionality.  NL-Soar  sati-ifies  all  four 
Tteria.  It  covers  a  broad  range  of  sentence  processing  phenomena;  garden  path  and 
unproblenuitic  ambiguities,  difficult  and  acceptable  embeddings,  modular  and  interactive 
ambiguii)  resc.utiori  effects,  immediacy  of  interpretation,  and  the  time  course  of  compre¬ 
hension.  it  accounts  for  the  phenomena  in  depth;  the  theory  makes  successful  predictions 
on  a  collection  of  over  100  items,  including  cross-linguistic  constructions.  The  theory  has 
an  archiicctund  basis;  the  Soar  cognitive  architecture,  which  provides  the  control  structure, 
memory  structures,  and  learning  mechanism.  Finally,  the  theory  is  functional;  the  model 
po.sirs  computational  mechanisms  that  realize  the  functions  of  comprehension  at  riieltiple 
level'  and  the  model  functions  as  a  working  comprehension  system. 

Tf  .e  architectural  grounding  proved  to  be  theoretically  and  empirically  fecund,  contribut  ¬ 
ing  to  a  number  of  firsts  for  tfie  model:  for  example,  the  first  zero-parameter  predictions  of 
comprehension  rate,  the  first  detailed  model  of  deliberate  recovery  from  garden  path;>,  and 
the  first  model  of  how  learning  might  modulate  modularity. 

In  short,  KT  .-Soar  provides  new  understanding  of  how  human  language  comprehension 
can  be  immeti  ate  and  real-time,  yet  extremely  flexible;  how  it  appears  to  effortlessly  handle 
local  ambiguiti;  s  and  embeddings  most  of  the  time,  yet  fail  in  certain  situatiicns,  how  it 
can  be  speci^d-purpose  and  finely  tuned  to  the  task,  yet  tightly  integrated  with  the  rest  of 
cognition;  .md  liov'  it  can  all  be  assembled  from  basic  computational  mechanisms  that  are 
fundamental  >  explaining  in:my  other  aspects  of  intelligent  behavior. 
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